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® DNA mutagenesis method. 

A method of DNA mutagenesis comprises 
(a) obtaining a DNA moiety encoding at least a portion of 
said precursor protein; 

<b) identifying a region within the moiety, 

(c) substituting nucleotides for those already existing wftnln 
the region In order to create at least one restriction enzyme site 
unique to the moiety, whereby unique restriction sites 5' ends' to 
the Identified region are made available such that neither alters 
the amino acids coded for by the region as expressed; 

(d) synthesizing a plurality of oligonucleotides, the 5' and 3' 
ends of which each contain sequences capable of annealing to 
the restriction enzyme sites introduced In step (c) and which, 
when ligated to the moiety, are expressed as substitutions, 
deletions and/or insertions of at least one amino add in or into 
said precursor protein; 

(e) digesting the moiety of step (c) with restriction enzymes 
capable of cleaving the unique sites; 

(f) llgating each of the oligonucleotides of step (d) Into the 
digested moiety of step (e) whereby a plurality of mutant DNA 
moieties are obtained; and optionally the further steps of 

(g) expressing each of said moieties as a mutant protein In a 
suitable host; 

(h) recovering the mutant proteins of step (g); and 

(!) screening the step (h) mutant proteins for the desirable 
characteristic 
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DNA MUTAGENESIS METHOD 



This invention relates to the production and 
manipulation of proteins using recombinant techniques in 
suitable hosts; more specifically to the directed muta- 
genesis of enzyme's in order to modify the characteristics 
]_0 thereof . 

Various bacteria are known to secrete proteases at 
some stage in their life cycles. Bacillus species produce 
two major extracellular proteases, a neutral protease (a 
15 metalloprotease inhibited by EDTA) and an alkaline protease 
(or subtilisin, a serine endoprotease) . Both generally are 
produced in greatest quantity after the exponential growth 
phase, when the culture enters stationary phase 
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and begins the process of sporulatlon. The physiological role of 
these two proteases is not clear. They have been postulated to play 
a role in sporulation (J. Hoch, 1976, "Adv. Genet." 18:69-98; 
p. Ptggot et aj_., 1976, "Bact. Rev." 40:908-962; and F. Priest, 
1977, "Bact. Rev." 41:711-753), to be involved tn the regulation of 
cell wall turnover (L. Jolliffe et aj... 1980, "J. Bact." 
141:1199-1208), and to be scavenger enzymes (Priest, Id.). The 
regulation of expression of the protease genes is complex. They 
appear to be coordinately regulated in concert with sporulatlon, 
since mutants blocked In the early stages of sporulatlon exhibit 
reduced -levels of both the alkaline and neutral protease. 
Additionally, a number of pleiotroptc mutations exist which affect 
the level of expression of proteases and other secreted gene 
products, such as amylase and levansucrase (Priest, Id.). 

Subtilisin has found considerable utility in industrial and 
commercial applications (see U.S. Patent No. 3,623,957 and 
0. Millet, 1970, "J. Appl. Bact." 33:207). For example, subtil 1sins 
and other proteases are commonly used in detergents to enable 
removal of protein-based stains. They also are used in food 
processing to accommodate the proteinaceous substances present In 
the food preparations to their desired impact on the composition. 

Classical mutagenesis of bacteria with agents such as radiation 
or chemicals has produced a plethora of mutant strains exhibiting 
different properties with respect to the growth phase at which 
protease excretion occurs as well as the timing and activity levels 
of excreted protease. These strains, however, do not approach the 
ultimate potential of the organisms because the mutagenic process is 
essentially random, with tedious selection and screening required to 
identify organisms which even approach the desired characteristics. 
Further, these mutants are capable of reversion to the parent or 
wild-type strain. In such event the desirable property is lost. 
The probability of reversion is unknown when dealing with random 
mutagenesis since the type and site of mutation 1s unknown or poorly 
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characterized. This introduces considerable uncertainty into the 
industrial process which is based on the enzyme-synthesizing 
bacterium. Finally, classical mutagenesis frequently couples a 
desirable phenotype, e.g., low protease levels, with an undesirable 
5 character such as excessive premature cell lysis. 

Special problems exist with respect to the proteases which are 
excreted by Bacillus. For one thing, since at least two such 
proteases exist, screening for the loss of only one Is difficult. 
10 Additionally, the large number of plelotropic mutations affecting 
both sporulatlon and -protease production make the Isolation of true 
protease mutations difficult. 

Temperature sensitive mutants of the neutral protease gene have 
15 been obtained by conventional mutagenic techniques, and were used to 
map the position of the regulatory and structural gene In the 
Bacillus subtil is chromosome (H. Uehara et a}_., 1979, "J. Bact." 
139:583-590). Additionally, a presumed nonsense mutation of the 
alkaline protease gene has been reported (C. Roitsch et al_., 1983, 
20 "J. Bact." 155:145-152). 

Bacillus temperature sensitive mutants. have been Isolated that 
produce inactive serine protease or greatly reduced levels of serine 
protease. These mutants, however, are asporogenous and show a 

25 reversion frequency to the wild-type of about from 1(T 7 to 10" 
(F. Priest, Id. p. 719). These mutants are unsatisfactory for the 
recombinant production of heterologous proteins because asporogenous 
mutants tend to lyse during earlier stages of their growth cycle in 
minimal medium than do sporogenic mutants, thereby prematurely 

30 releasing cellular contents (Including Intracellular proteases) into 
the culture supernatant. The possibility of reversion also 1s 
undesirable since wild-type revertants will contaminate the culture 
supernatant with excreted proteases. 



Bacillus sp- have been proposed for the expression of 
heterologous proteins, but the presence of excreted proteases and 
the potential resulting hydrolysis of the desired product has 
retarded the commercial acceptance of Bacillus as a host for the 
expression of heterologous proteins. Bacillus megaterima mutants 
have been disclosed that are capable of sporulation and which do not 
express a sporulation-associated protease during growth phases. 
However, the assay employed did not exclude the presence of other 
proteases, and the protease in question is expressed during the 
sporulation phase (C. Loshon et al_., 1982, "0. Bact." 1^:303-311). 
This, of course, is the point at which heterologous protein would 
have accumulated in the culture and be vulnerable. The present 
invention enables the construction of a Bacillus strain that is 
substantially free of extracellular neutral and alkaline protease 
during all phases of Its growth cycle and which exhibits 
substantially normal sporulation characteristics. A need exists for 
non-revertible, otherwise normal protease deficient organisms that 
can then be transformed with high copy number plasmids for the 
expression of heterologous or homologous proteins. 

Enzymes having characteristics which vary from available stock 
are required. In particular, enzymes having enhanced oxidation 
stability will be useful 1n extending the shelf life and bleach 
compatibility of proteases used 1n laundry products. Similarly, 
reduced oxidation stability would be useful in Industrial processes 
that require the rapid and efficient quenching of enzymatic activity. 

Modifying the pH-activlty profiles of an enzyme would be useful 
in making the enzymes more efficient 1n a wide variety of processes, 
e.g. broadening the pH-activily profile of a protease would produce 
an enzyme more suitable for both alkaline and neutral laundry 
products. Narrowing the profile, particularly when combined with 
tailored substrate specificity, would make enzymes in a mixture more 
compatible, as will be further described herein. 
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Mutations of procaryotic carbonyl Hydrolases (principally 
proteases but Including lipases) will facilitate preparation of a 
variety of different tydrolases, particularly those having other 
modified properties such as Km, Kcat, Kra/Kcat ratio and substrate 
5 specificity. These enzymes can then be tailored for the particular 
substrate which is anticipated to be present, for example in the 
preparation of peptides or for hydrolytic processes such as laundry 
uses. 

10 Chemical modification of enzymes is known. For example, see I. 
Svendserr, 1976, "Carlsberg Res. Commun." 41_ (5): 237-291. These 
methods, however, suffer from the disadvantages of being dependent 
upon the presence of convenient amino acid residues, are frequently 
nonspecific in that they modify all accessible residues with common 

15 side chains, and are not capable of reaching inaccessible amino acid 
residues without further processing, e.g. denaturation, that is 
generally not completely reversible in reinstltuting activity. To 
the extent that such methods have the objective of replacing one 
amino acid residue side chain for another side chain or equivalent 

20 functionality, then mutagenesis promises to supplant such methods. 

Predetermined, site-directed mutageneses of tRNA synthetase in 
which a cys residue is converted to serine has been reported 
(G. Winter et a]_., 1982, "Nature 11 299:756-758; A. Wilkinson et aK, 
25 1984, "Nature" 307:187-188). This method is not practical for large 
scale mutagenesis. It is an object herein to provide a convenient 
and rapid method for mutating DNA by saturation mutagenesis. 



A method for producing procaryotic carbonyl hydrolase such as 
subtil 1 sin and neutral protease in recombinant host cells is 
described in which expression vectors containing sequences which 
35 encode desired subtilisin or neutral protease, including the pro, 
0992Y 
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pre, or prepro forms of these enzymes, are used to transform hosts, 

the host cultured and desired enzymes recovered- By 
mutagenesis , the coding sequence can 

contain modifications which confer desirable properties on the 
protein that is produced, as is further described below. 

The novel strains then are transformed with at least one DNA 
moiety encoding a polypeptide not otherwise expressed in the host 
strain, the transformed strains cultured and the polypeptide 
recovered from the culture. Ordinarily, the DNA moiety is a 
directed* mutant of a host Bacillus gene, although it may be DMA 
encoding a eucaryotic (yeast or mammalian) protein- The novel 
strains also serve as hosts for protein expressed from a bacterial 
gene derived from sources other than the host genome, or for vectors 
expressing these heterologous genes, or homologous genes from the 
host genome- In the latter event enzymes such as amylase are 
obtained free of neutral protease or subtilisin- In addition, it is 
now possible to obtain neutral protease in culture which is free of 
en^ymatically active subtilisin, and vice-versa. Mutated Bacillus 
species incapable of excreting Subtilisin or neutral 
protease form the subject of copending divisional 
application No, 87 

The parent application EP 130756-A hereof is directed 
to procaryotic carbonyl hydrolases and mutants thereof, and 
their production in transformed hosts. 

The present invention provides a convenient method for 
saturation mutagenesis, thereby enabling the rapid and 
efficient generation of a plurality of mutations at any one 
site within the coding region of a protein, comprising: 

(a) obtaining a DNA moiety encoding at least a portion 
of said precursor protein; 
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(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already existing 
within the region In order to create at least one 

5 restriction enzyme site unique to the moiety, whereby unique 

restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

10 (d) synthesizing a plurality of oligonucleotides, the 5' and 

-3' ends of which each contain sequences capable of annealing 
to the restriction enzyme sites introduced in step (c) and 
which, when 11 gated to the moiety, are expressed as 
substitutions, deletions and/or Insertions of at least one 

15 amino add 1n or into said precursor protein; 

(e) digesting the moiety of step (c) with restriction 
enzymes capable of cleaving the unique sites; and 

20 (f) ligating each of the oligonucleotides of step (d> into 

the digested moiety of step (e) whereby a plurality of 
mutant DMA moieties are obtained. . 

By the foregoing method a mutation can 
25 be introduced into Isolated DMA encoding a procaryotlc carbonyl 
hydrolase which, upon expression of the DNA, results 1n the 
substitution, deletion or insertion of at least one amino acid at a 
predetermined site in the hydrolase. This method is useful 1n 
creating mutants of wild type proteins (where the "precursor" 
30 protein is the wild type) or reverting mutants to the wild type 
(where the "precursor" Is the mutant 

Mutant enzymes are recovered which exhibit oxidative stability 
and/or pH-activily profiles which differ from the precursor 
35 enzymes. Procaryotic carbonyl hydrolases having varied Km, Kcat, 
Kcat/Km ratio and substrate specificity also. are provided herein. 
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The mutant enzymes obtained by the methods herein are combined 
in known fashion with surfactants or detergents to produce novel 
compositions useful in the laundry or other cleaning arts. 

5 

Brief Description of the Drawings 

Figure 1 shows the sequence of a functional B. aayloliquefaciens 
subtil isin gene, 

10 

In Figure 1A, the entire functional sequence f or B. 
anyloliquefaclens, including the promoter and ribosome binding site, 
are present on a US fcb fragment of the B. amy! oli que fact ens genome, 

15 Figure IB shows the nucleotide sequence of the coding strand, 
correlated with the amino acfd sequence of the protein. Promoter 
(p) ribosome binding site (rbs) and termination (terra) regions of 
the DMA sequence are also shown. 

20 Figure 2 shows the results of replica nitrocellulose filters of 
purified positive clones probed with Pool 1 (Panel A) and Pool 2 
(Panel B) respectively. 

Figure 3 shows the restriction analysis of the subtil isfn 
25 expression plasraid (pS4). pBS42 vector sequences (4.5 kb) are shown 
in solid while the Insert sequence (4.4 kb) Is shown dashed. 

Figure 4 shows the results of SDS-PAGE performed on supernatants 
from cultures transformed with pBS42 and pS4. 

30 

Figure 5 shows the construction of the shuttle vector pBS42. 

Figure 6 shows a restriction map for a sequence including the B. 
subtil is subtil isin gene. 

35 
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Figure 7 is the sequence of a functional B, subtil is subtil 1sin 
gene. 

Figure 8 demonstrates a construction method for obtaining a 
5 deletion mutant of a B. subtil is subtil isin gene. 

Figure 9 discloses the restriction map for a B. subtil is neutral 
protease gene. 

10 Figure 10 is the nucleotide sequence for a B. subtil is neutral 
protease gene. 

Figure 11 demonstrates the construction of a vector containing a 
P_. subtil is neutral protease gene. 

15 

Figures 12, 13 and 16 disclose embodiments of the mutagenesis 
technique provided herein. 

Figure 14 shows the enhanced oxidation stability of a subtillsln 
20 mutant. 

Figure 15 demonstrates a change in the pH-act1vity profile of a 
subtillsln mutant when compared to the wild type enzyme. 



25 

Detailed Description 

Procaryotic carbonyl hydrolases are enzymes which hydrolyze 
0 

30 compounds containing C-X bonds In which X Is oxygen or nitrogen. 
They principally include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subtilisins or metal! oproteases. Peptide 
hydrolases include «-ara1noacyl peptide hydrolase, peptidylam1no-a 
hydrolase, acylamino hydrolase, serine carboxypeptidase, 

35 metallocarboxypeptldase. thiol proteinase, carboxyl proteinase an 
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metalloprotelnase. Serine, raetallo, thiol and acid proteases are 
included, as well as endo and exo-pro teases. 

Subtilisins are serine proteinases which generally act to cleave 
internal peptide bonds of proteins or peptides- Hetalloproteases 
are exo- or endoproteases which require a metal ion cofactor for 
activity- 

A number of naturally occurring mutants of subtil isin or neutral 
protease exist, and all ra*y be employed with equal effect herein as 
sources .for starting genetic material. 

These enzymes and their genes may be obtained from many 
procaryotic organisms- Suitable examples include gram negative 
organisms such as E. coli or pseudomonas and gram positive bacteria 
such as micrococcus or bacillus. 

The genes encoding the carbonyl hydrolase m^y be obtained in 
accord with the general method herein. As will be seen from the 
examples, this comprises synthesizing labelled probes having 
putative sequences encoding regions of the hydrolase of interest, 
preparing genomic libraries from organising expressing the 
f\ydrolase, and screening the libraries for the gene of interest by 
hybridization to the probes. Positively hybridizing clones are then 
mapped and sequenced. The cloned genes are ligated Into an 
expression vector (which also may be the cloning vector) with 
requisite regions for replication in the tost, the plasmid 
transfected into a host for enzyme synthesis and the recombinant 
host cells cultured under conditions favoring enzyme synthesis, 
usually selection pressure such as 1s supplied by the presence of an 
antibiotic, the resistance to which is encoded by the vector. 
Culture under these conditions results in enzyme yields multifold 
greater than the wild type enzyme synthesis of the parent organism, 
even If it is the parent organism that 1s transformed. 
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"Expression vector" refers to a DMA construct containing a DNA 
sequence which is operably linked to a suitable control sequence 
capable of effecting the expression of said DNA in a suitable host- 
Such control sequences include a promoter to effect transcription, 

5 an optional operator sequence to control such transcription, a 
sequence encoding suitable.mRNA rlbosome binding sites, and 
sequences which control termination of transcription and 
translation. The vector may be a plasmid, a phage particle, or 
simply a potential genomic insert. Once transformed into a suitable 

10 host, the vector may replicate and function independently of the 
host genome, or may, in some instances, integrate into the genome 
itself. In the present specification, "plasmid" and "vector 11 are 
sometimes used Interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the invention 

15 1s intended to Include such other forms of expression vectors which 
serve equivalent functions and which are, or become, known in the 
art. 

"Recombinant host cells" refers to cells which have been 
20 transformed or transfected with vectors constructed using recombinant 
DNA techniques. As relevant to the present invention, recombinant 
host cells are those which produce procaryotic carbonyl hydrolases 
in its various forms by virtue of having been transformed with 
expression vectors encoding these proteins. The recombinant host 
25 cells may or may not have produced a form of carbowl hydrolase 
prior to transformation - 

"Operably linked" when describing the relationship between two 
DNA regions simply means that they are functionally related to each 

30 other. For example, a presequence is operably linked to a peptide 
if it functions as a signal sequence, participating in the secretion 
of the mature form of the protein most probably involving cleavage 
of the signal sequence- A promoter is operably linked to a coding 
sequence if it controls the transcription of the sequence; a 

35 rlbosome binding site is operably linked to a coding sequence if it 
1s positioned so as to permit translation. 
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"Prohydrolase" refers to a hydrolase which contains additional 
N-ternrlnal amino acid residues which render the enzyme inactive but, 
when removed, yield an enzyme- Many proteolytic enzymes are found 
in nature as trans! ational proenzyme products and, in the absence of 
post-transl ational products, are expressed In this fashion. 

"Presequence" refers to a signal sequence of amino acids bound 
to the N-terminal portion of the hydrolase which m^y participate iA 
the secretion of the hydrolase. Pnesequences also may be modified 
in the same fashion as is described here, Including the introduction 
of predetermined mutations. When bound to a hydrolase, the subject 
protein becomes a "prehydrolase". Accordingly, relevant 
prehydrolase for the purposes herein are presubtilisin and 
preprosubtilisin. Prehydrolases are produced by deleting the "pro" 
sequence (or at least that portion of the pro sequence that 
maintains the enzyme in its Inactive state) from a prepro coding 
region, and then expressing the prehydrolase. In this way the 
organism excretes the active rather than proenzyme. 

The cloned carbonyl hydrolase is used to transform a host cell 
in order to express the hydrolase. This will be of interest where 
the hydrolase has commercial use in its unmodified form, as for 
example subtilisin in laundry products as noted above. In the 
preferred embodiment the hydrolase gene is ligated into a high copy 
number plasmid. This plasmid replicates in hosts in the sense that 
it contains the well-known elements necessary for plasmid 
replication: a promoter operably linked to the gene in question 
(which may be supplied as the gene's own homologous promoter if it 
is recognized, i.e., transcribed, by the host), a transcription 
termination and polyadeiyylation region (necessary for stability of 
the mRHA transcribed by the host from the hydrolase gene) which is 
exogenous or is supplied by the endogenous terminator region of the 
hydrolase gene and, desirably, a selection gene such as an 
antibiotic resistance gene that enables continuous cultural 
maintenance of plasmid- infected host cells by growth in 
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antibiotic-containing media* High copy number plasmids also contain 
an origin of replication for the host, thereby enabling large 
numbers of plasmids to be generated in the cytoplasm without 
chromosonal limitations. However, it is within the scope herein to 
5 integrate multiple copies of the hydrolase gene into host genome. 
This, is facilitated by bacterial strains which are particularly 
susceptible to homologous recombination* The resulting host cells 
are termed recombinant host cells. 

10 Once the carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the gene beyond 
synthesis of the wild type or precursor enzyme. A precursor en2yme 
is the enzyme prior to its modification as described in this 
application. Usually the precursor is the enzyme as expressed by 

15 the organism which donated the DMA modified in accord herewith. The 
term "precursor" is to be understood as not implying that the 
product enzyme was the result of manipulation of the precursor 
enzyme £er se. 

20 In the first of these modifications, the gene may be deleted 
from a recombination positive (rec + ) organism containing a 
homologous gene. This is accomplished by recombination of an in 
vitro deletion mutation of the cloned gene with the genome of the 
organism. Many strains of organisms such as E .coli and Bacillus are 

25 known to be capable of recombination* All that is needed is for 
regions of the residual DNA from the deletion mutant to recombine 
with homologous regions of the candidate host. The deletion may be 
within the coding region (leaving enzymatically Inactive 
polypeptides) or include the entire coding region as long as 

30 homologous f tanking regions (such as promoters or termination 
regions) exist in the host. Acceptability of the host for 
recombination deletion mutants is simply determined by screening for 
the deletion of the transformed phenotype. This 1s most readily 

accomplished in the case of carbor\yl hydrolase by assaying host 
35 cultures for loss of the ability to cleave a chrotnogenic substrate 
otherwise hydrolyzed by the hydrolase. 
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Transformed hosts containing the protease deletion mutants are 
useful for synthesis of products which are incompatible with 
proteolytic enzymes. These hosts by definition are incapable of 
excreting the deleted proteases described herein, yet are 
substantially normally sporulatlng. Also the other growth 
characteristics of the transformants are substantially like the 
parental organism. Such organisms are useful in that it is expected 
they will exhibit comparatively less inactivation of heterologous 
proteins than the parents, and these hosts do have growth 
characteristics superior to known p rotease-def 1 ci ent organisms. 
However/ the deletion of neutral protease and subtil isin as 
described in this application does not remove all of the proteolytic 
activity of Bacillus. It is believed that intracellular proteases 
which are not ordinarily excreted extracellularly "leak" or diffuse 
from the cells during late phases of the culture. These 
intracellular proteases m*y or may not be subtil isin or neutral 
protease as those enzymes are defined herein. Accordingly, the 
novel Bacillus strains herein are incapable of excreting the 
subtil isin and/or neutral protease enzymes which ordinarily are 
excreted extracellularly In the parent strains. "Incapable" means 
not revertible to the wild type. Reversion 1s a finite probability 
that exists with the heretofore known protease^deficient, naturally 
occurring strains since there is no assurance that the phenotype of 
such strains is not a function of a readily revertible mutation, 
e.g. a point mutation. This to be contrasted with the extremely 
large deletions provided herein. 

The deletion mutant-transformed host cells herein are free of 
genes encoding enzymatically active neutral protease or subtil isin, 
which genes are defined as those being substantially homologous with 
the genes set forth in Figs. 1, 7 or TO. "Homologous 11 genes contain 
coding regions capable of hybridizing under high stringency 
conditions with the genes shown in Figs. 1, 7 or 10. 
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The microbial strains containing carbonyl hydrolase deletion 
mutants are useful in two principal processes. In one embodiment 
they are advantageous in the fermentative production of products 
ordinarily expressed by a host that are desirably uncontaminated 
5 with the protein encoded by the deletion gene. An example is 
fermentative synthesis of amylase, where contaminant proteases 
Interfere in many industrial uses for amylase. The novel strains 
herein relieve the art from part of the burden of purifying such 
products free of contaminating carbonyl hydrolases. 

10 

In a- second principal embodiment, subtflfsln and neutral 
protease deletion-mutant strains are useful in the synthesis of 
protein which 1s not otherwise encoded by the strain. These 
proteins will fall within one of two classes. The first class 

15 consists of proteins encoded by genes exhibiting no substantial 
pretransformation homology with those of the host. These may be 
proteins from other procaryotes but ordinarily are eucaryotic 
proteins from yeast or higher eucaryotic organisms, particularly 
mammals. The novel strains herein serve as useful hosts for 

20 expressible vectors containing genes encoding such proteins because 
the probability for proteolytic degradation of the expressed, 
non-homologous proteins is reduced. 

The second group consists of mutant host genes exhibiting 
25 substantial pretransformation homology with those of the host. 
These include mutations of procaryotlc carbonyl hydrolases such as 
subtilisin and neutral protease, as well as microbial (rennin, for 
example rennin from the genus Mucor). These mutants are selected in 
order to improve the characteristics of the precursor enzyme for 
30 industrial uses. 

A novel method is provided to facilitate the construction and 
identification of such mutants. First, the gene encoding the 
hydrolase is obtained and sequenced in whole or in part. Then the 
35 sequence is scanned for a point at which it is desired to make a 
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mutation (deletion, Insertion or substitution) of one or more amino 
acids in the expressed enzyme. The sequences flanking this point 
are evaluated for the presence of restriction sites for replacing a 
short segment of the gene with an oligonucleotide pool which when 
5 expressed will encode various mutants. Since unique restriction 
sites are generally not present at locations within a convenient 
distance from the selected point (from 10 to 15 nucleotides], such 
sites are generated by substituting nucleotides In the gene in such 
a fashion that neither the reading frame nor the amino acids encoded 

10 are changed in the final construction. The task of locating 
suitable flanking regions and evaluating the needed changes to 
arrive at two unique restriction site sequences is made routine by 
the redundancy of the genetic code, a restriction en2yme map of the 
gene and the large number of different restriction enaymes. Note 

15 that If a fortuitous flanking unique restriction site is available, 
the above method need be used only in connection with the flanking 
region which does not contain a site. 

Mutation of the gene 1n order to change its sequence to conform 
20 to the desired sequence 1s accomplished by Ml 3 primer extension in 
accord with generally known methods. Once the gene is cloned, it is 
digested with the unique restriction enqrmes and a plurality of end 
termini -complementary oligonucleotide cassettes are llgated into the 
unique sites. The mutagenesis is enormously simplified by this 
25 method because all of the oligonucleotides can be synthesized so as 
to have the same restriction sites, and no synthetic linkers are 
necessary to create the restriction sites. 

The number of commercially available restriction enzymes having 
30 sites not present in the gene of Interest is generally large. A 

suitable DMA sequence computer search program simplifies the task of 
finding potential 5 1 and 3 l unique flanking sites. A primary 
constraint is that any mutation introduced in creation of the 
restriction site must be silent to the final constructed amino acid 
35 coding sequence. For a candidate restriction site 5* to the target 
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codon a sequence must exist 1n the gene which contains at least all 
the nucleotides but for one in the recognition sequence 5* to the 
cut of the candidate enzyme. For example, the blunt cutting enzyme 
Sraal (CCC/GGG) would be a 5 1 candidate if a nearby 5' sequence 
contained NCC, CNC, or CCN. Furthermore, if N needed to be altered 
to C this alteration roust leave the amino acid coding sequence 
intact* In cases where a permanent silent mutation is necessary to 
Introduce a restriction site one m*y want to avoid the introduction 
of a rarely used codon. A similar situation for Smal would apply 
for 3' flanking sites except the sequence KGG, GNG, or GGM must 
exist. -The criteria for locating candidate enzymes is most relaxed 
for blunt cutting enzymes and most stringent for 4 base overhang 
enzymes. In general many candidate sites are available. For the 
codon-222 target described herein a Ball site (TGG/CCA) could have 
been engineered in one base pair 5* from the Kpnl site. A3 1 EcoRV 
site (GAT/ATC) could have been employed 11 base pairs 5' to the PstI 
site. A cassette having termini ranging from a blunt end up to a 
four base-overhang will function without difficulty. In retrospect, 
this hypothetical EcoRV site would have significantly shortened the 
oligonucleotide cassette employed (9 and 13 base pairs) thus 
allowing greater purity and lower pool bias problems. Flanking 
sites should obviously be chosen which cannot. themselves ligate so 
that ligation of the oligonucleotide cassette can be assured in a 
single orientation. 

The mutation £er se need not be predetermined. For example, an 
oligonucleotide cassette or fragment is randomly mutagenized with 
nitrosoguanidine or other mutagen and then in turn ligated Into the 
hydrolase gene at a predetermined location. 

The mutant carbonyl. hydrolases expressed upon transformation of 
the suitable hosts are screened for enzymes exhibiting desired 
characteristics, e.g. substrate specificity, oxidation stability, 
pH-activity profiles and the like. 
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A change in substrate specificity is defined as a difference 
between the Kcat/Km ratio of the precursor eiwyrae and that of the 
mutant. The Kcat/Km .ratio is a measure of catalytic efficiency. 
Procaryotic carbonyl hydrolases with increased or diminished Kcat/Km 

5 ratios are described in the examples. Generally, the objective will 
be to secure a mutant having a greater (numerically larger) Kcat/Km 
ratio for a given substrate, thereby enabling the use of the enzyme 
to more efficiently act on a target substrate. An Increase In 
Kcat/Km ratio for one substrate may be is accompanied by a reduction 

10 in Kcat/Km ratio for another substrate. This 1s a shift In 
substrate specificity, and mutants exhibiting such shifts have 
utility where the precursors are undesirable, e.g. to prevent 
undesired hydrolysis of a particular substrate in an admixture of 
substrates. 

15 

Kcat and Km are measured in accord with known procedures, or as 
described In Example 18. 

Oxidation stability is a further objective which is accomplished 
20 by mutants described in the examples. The stability may be enhanced 
or diminished as is desired for various uses. Enhanced stability is 
effected by deleting one or more methionine, tryptophan, cysteine or 
lysine residues and, optionally, substituting another amino acid 
residue not one of methionine, tryptophan, cysteine or lysine. The 
25 opposite substitutions result In diminished oxidation stability. 
The substituted residue is preferably alanyl, but neutral residues 
also are suitable. 

Mutants are provided which exhibit modified pH-activity 
30 profiles. A-pH-actlvity profile is a plot of pH against enxyme 
activity and may be constructed as illustrated 1n Example 19 or by 
methods known in the art. It may be desired to obtain mutants with 
broader profiles, I.e., those having greater activity at certain pH 
than the precursor, but no significantly greater actlvlly at any pH, 
35 or mutants with sharper profiles, i.e. those having enhanced 
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activity when compared to the precursor at a given pH. and lesser 
activity elsewhere. 

The foregoing mutants preferably are made within the active site 
of the enzyme as these. mutations are most likely to influence 
activity. However, mutants at other sites important for enzyme 
stability or conformation are useful. In the case of Bacillus 
subtilisin or its pre, prepro and pro forms, mutations at tyrosine-1, 
aspartate+32, asparagine+155, tyroslne+104, meth1on1ne+222, 
glycine+166. histidine+64, glycine+169, phenyl aUnlne+189, serlne+33, 
serine+221, tyrosine+217, glutamate+156 and/or alanlne+152 produce 
mutants having changes In the characteristics described above or in 
the processing of the enzyme. Note that these amino add position 
numbers are those assigned to B, ^loTimefaciens subtilisin as 
seen from F1g. 7. It should be understood that a deletion or 
Insertion In the N-teminal direction from a given position will 
shift the relative amino acid positions so that a residue will not 
occupy its original or wild type numerical position. Also, allelic 
differences and the variation among various procaryotic species will 
result in positions shifts, so that position 169 In such subtlllslns 
will not be occupied by glycine. In such cases the new positions 
for glycine will be considered equivalent to and embraced within the 
designation glydne+169. The new position for glyclne+169 is 
readily Identified by scanning the subtilisin 1n question for a 
region homologous to glydne+169 1n Fig. 7. 

One or more, ordinarily up to about 10, amino acid residues may 
be mutated. However, there is no limit to the number of mutations 
that are to be made aside from commercial practicality. 

The enzymes herein may be obtained as salts. It is clear that 
the ionization state of a protein will be dependent on the pH of the 
surrounding medium, 1f it 1s In solution, or of the solution from 
which it is prepared, if it is in solid form. Addle proteins are 
commonly prepared as, for example, the ammonium, sodium, or 
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potassium salts; basic proteins as the chlorides, sulfates, or 
phosphates. Accordingly, the present application includes both 
electrically neutral and salt forms of the designated carbonyl 
hydrolases, and the term carbonyl hydrolase refers to the organic 
5 structural backbone regardless of Ionization state. 

The mutants are particularly useful in the food processing and 
cleaning arts. The carbonyl hydrolases, including mutants, are 
produced by fermentation as described herein and recovered by 

10 suitable techniques. See for example K. Anstrup, 1974, Industrial 
Aspects of Biochemistry, ed. B. Spencer pp. 23-46. They are 
formulated with detergents or other surfactants In accord with 
methods known per se for use in Industrial processes, especially 
laundry. In the latter case the enzymes are combined with 

15 detergents, builders, bleach and/or fluorescent whitening agents as 
is known in the art for proteolytic enzymes. Suitable detergents 
Include linear alkyl benzene sulfonates, alky! ethoxylated sulfate, 
sulfated linear alcohol or ethoxylated linear alcohol. The 
compositions may be formulated In granular or liquid form. See for 

20 example U.S Patents 3,623,957; 4,404,128; 4,381,247; 4,404,115; 
4,318,818; 4,261,868; 4,242,219; 4,142,999; 4,111,855; 4,011,169; 
4,090,973; 3,985,686; 3,790,482; 3,749,671; 3,560,392; 3,558,498; 
and 3,557,002. 

25 The following disclosure is Intended to serve as a 

representation of embodiments herein, and should not be construed as 
limiting the scope of this application. 



30 Glossary of Experimental Manipulations 

In order to simplify the Examples certain frequently occurring 
methods will be referenced by shorthand phrases. 



35 
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Plasmlds are designated by a small p preceeded and/or followed 
by capital letters and/or numbers. The starting plasmlds herein are 
commercially available, are available on an unrestricted basis, or 
can be constructed from such available plasmlds In accord with 
6 published procedures. . 

"Klenow treatment" refers to the process of filling a recessed 
3' end of double stranded DMA with deoxyribonucleotides 
complementary to the nucleotides making up the protruding 5* end of 

10 the DMA strand. This process is usually used to fill 1n a recessed 
end resulting from a' restriction enzyme cleavage of DNA. This 
creates a blunt or flush end, as may be required for further 
ligations. Treatment with Klenow Is accomplished by reacting 
(generally for 15 minutes at 15°C) the appropriate complementary 

15 deoxyribonucleotides with the DMA to be filled 1n under the 

catalytic activity (usually 10 units) of the Klenow fragment of 
E. coll DNA polymerase I ("Klenow"). Klenow and the other reagents 
needed are commercially available. The procedure has been published 
extensively. See for example T. Maniatls et al., 1982, Molecular 

20 Cloning , pp.. 107-108. 

"Digestion" of DMA refers to catalytic cleavage of the DMA with 
an enzyme that acts only at certain locations in the DNA. Such 
enzymes are called restriction enzymes, and the sites for which each 

25 is specific Is called a restriction site. "Partial" digestion 
refers to incomplete digestion by a restriction enzyme, i.e., 
conditions are chosen that result 1n cleavage of some but not all of 
the sites for a given restriction endonuclease In a DNA substrate. 
The various restriction enzymes used herein are commercially 

30 available and their reaction conditions, cofactors and other 
requirements as established by the enzyme suppliers were used. 
Restriction enzymes commonly are designated by abbreviations 
composed of a capital letter followed by other letters and then, 
generally, a number representing the microorganism from which each 

35 restriction enzyme originally was obtained. In general, about 1 »g 
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of plasmid or DMA fragment is used with about 1 unit of enzyme in 
about 20 iii of buffer solution. Appropriate buffers and substrate 
amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37°C are 

5 ordinarily used, but may vary in accordance with the supplier's 
instructions. After incubation, protein is removed by extraction 
with phenol and chloroform, and the digested nucleic acid is 
recovered from the aqueous fractfon by precipitation with ethanoK 
Digestion with a restriction en?yme infrequently is followed with 

10 bacterial alkaline phosphatase tydrolysis of the terminal 5* 
phosphates to prevent the two restriction cleaved ends of a DNA 
fragment from "circularizing" or forming a closed loop that would 
impede insertion of another DNA fragment at the restriction site. 
Unless otherwise stated, digestion of plasmids Is not followed by 5 r 

15 terminal dephosphorylation. Procedures and reagents for 

dephosphorylation are conventional (T. Maniatis et <*!-, Id., 
pp. 133-134) . 

"Recovery" or "Isolation" of a given fragment* of DNA from a 
20 restriction digest means separation of the digest on 6 percent 

polyacryl amide gel electrophoresis, identification of the fragment 
of Interest by molecular weight (using DNA fragments of known 
molecular weight as markers), removal of the gel section containing 
the desired fragment, and separation of the gel from DMA. This 
25 procedure is known generally. For example, see R. Lawn et aj. , 
1981, "Nucleic Adds Res." 2:6103-6114, and D. Goeddel et al_., 
(19801 "Nucleic Acids Res." 8:4057. 

"Southern Analysis" 1s a method by which the presence of DNA 
30 sequences in a digest or DNA-containing composition 1s confirmed by 
hybridization to a known, labelled oligonucleotide or DNA fragment. 
For the purposes herein, Southern analysis shall mean separation of 
digests on 1 percent agarose and depurination as described by 
G. Wahl et al_., 1979, "Proc. Nat. Acad. Sci. U.S.A." 76:3683-3687, 
35 transfer to nitrocellulose by the method of E. Southern, 1975, 
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"0. Mol. Biol." 98:503-517, and hybridization as described by 
T. Maniatls etaj_., 1978, "Cell" 15:687-701. 

"Transformation" means introducing DNA into an organism so that 
5 the DMA is replicable, either as an extrachroraosomal element or 
chromosomal integrant. Unless otherwise stated, the method used 
herein for transformation of E. coll is the CaCl 2 method of Mandel 
et aj_., 1970, "J. Mol. B1ol." 53:154, and for Bacillus, the method 
of Anagnostopolous et aj_., 1961, "J. Bact." 81:791-746. 

10 

"Ligation" refers* to the process of forming phosphodi ester bonds 
between two double stranded nucleic acid fragments (T. Maniatls 
et aJL, Id., p. 146). Unless otherwise stated, ligation was 
accomplished using known buffers and conditions with 10 units of T4 
15 DNA Hgase ("ligase") per 0.5 ug of approximately equimolar amounts 
of the DNA fragments to be 11 gated. Plasmlds from the transformants 
were prepared, analyzed by restriction mapping and/or sequenced by 
the method of Messing, et a]_., 1981, "Nucleic Adds Res.", 9:309. 

20 "Preparation" of DNA from transformants means isolating plasmld 
DNA from microbial culture. Unless otherwise stated, the 
alkaline/SDS method of Maniatls et al_., Id. p.- 90., was used. 

"Oligonucleotides" are short length single or double stranded 
25 polydeoxynucleotldes which were chemically synthesized by the method 
Of Crea et aj_., 1980, "Nucleic Acids Res." 8:2331-2348 (except that 
mesitylene nltrotriazole was used as a condensing agent) and then 
purified on polyacryl amide gels. 

30 All literature citations are expressly incorporated by reference. 
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Example 1 

Preparation of a Genomic DMA Library from B* amyloliquifaciens 
and Isolation of its Subtil isfn Gene 

The known amino acid sequence of the extracellular 
B. amyloliquefaciens permits the construction of a suitable probe 
mixture- The sequence of the mature subtilisin is included (along 
with the additional information contributed by the present work) in 
Figure 1. All codon ambiguity for the sequence of amino acids at 
position- 117 through 121 is covered by a pool of eight 

oligonucleotides of the sequence M(£)M(y)ATGGA(ylGT. 

Chromosomal DNA isolated from amyloliquefaciens (ATCC No* 
23844) as described by J. Marmur, "J- Mol, Biol J 1 , 2 :208 » was 
partially digested by Sau 3A, and the fragments size selected and 
ligated into the BamH 1 site of dephosphoiylated pBS42. (pBS42 is 
shuttle vector containing origins of replication effective both in 
E. coTi and Bacillus. It is prepared as described in Example 4.) 
The Sau3A fragment containing vectors were transformed into E. coli 
K12 strain 294 (ATCC Wo. 31446) according to the method of M. 
Mandel, et aU, 1970, tt J. Mol. Bio. 14 53: 154 using 80-400 nanograms 
of library DNA per 250nL of competent celts. 

Cells from the transformation mixture "were plated at a 
density of 1-5 x transformants per 150mm plate containing LB 
medium + 12.5 ng/ml chloramphenicol, and grown overnight at 37°C 
until visible colonies appeared. The plates were then replica 
plated onto BA85 nitrocellulose filters overlayed on LB/chloram- 
phenicol plates. The replica plates were grown 10-12 hours at 37°C 
and the filters transferred to fresh plates containing LB and 
150 ng/ml spectinomycin to amplify the plasmid pool. 

After overnight incubation at 37'C, filters were processed 
essentially as described by Grunstein and Hogness, 1975, "Proc. 
09g2Y 
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Natl. Acad. Sci. (USA)" 72: 3961- Out of approximately 20,000 
successful transformants, 25 positive colonies were found. Eight of 
these positives were streaked to purify individual clones. 24 
clones from each streak were grown in tnlcrotiter wells, stamped on 
5 to two replica filters, and probed as described above with either 

AACAA(j)ATGGA(j)6T{pool 1) or AATAA(£)ATGGA(£)GT(pool 2) which differ 

by only one nucleotide. As shown in Figure 2, pool 1 hybridized to a 
much greater extent to all positive clones than did pool 2, suggesting 
|Q specific hybridization. 

Four out of five miniplasmid preparations (Maniatis et al« , 
Id.) from positive clones gave identical restriction digest patterns 
when digested with Sau3A or Hindi. The pi asmld Isolated from one of 
fC these four identical colonies by the method of Kaniatis et al., Id., 
had the entire correct gene sequence and was designated pS4. The 
characteristics of this plasmid as determined by restriction analysis 
are shown In Figure 3. 



20 

Example 2 

Expression of the Subtil isin Gene 

2B Bacillus subtil Is 1-168 (Catalog No. 1-A1, Bacillus Genetic 

Stock Center) was transformed with pS4 and and a single chloramphenicol 
resistant transformant then grown in minimal medium. After 24 hours, 
the culture was centrlfuged and both the supernatant (10-200 ul) and 
pellet assayed for proteolytic activity by measuring the change in 
absorbance per trdnute at 412 nm using 1 ml of the chromogenic substrate 

30 

succ1nyl*L-ala-ala-pro-phe-p-nitroam"l1de (0.2 iiM) in 0.1M sodium 
phosphate (pH 8.0) at 25°C. A B. subtil is 1^168 culture transformed 
with pBS42 used as a control showed less than 1/200 of the activity 
shown by the pS4 transformed culture. Greater than 95 percent of the 
protease activity of the pS4 culture was present in the supernatant, 
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and was completely inhibited by treatment with phenyl methyl sulfonyl 
fluoride (PMSF) but not by EDTA. 

Aliquots of the supernatants were treated with PMSF and EDTA to 
5 inhibit all protease activity and analyzed by 12 percent SDS-PAGE 
according to the method of Laemmli, U.K., 1970 "Mature", 227: 680. To 
prepare the supernatants, 16 uL of supernatant was treated with ImM 
PMSF, 10 mM EDTA for 10 minutes, and boiled with 4 tiL of 5x 
concentrated SDS sample buffer minus 8-mercaptoethanol . The results 

10 of Coomassie stain on runs using supernatants of cells transformed 
with pS4; pBS42, and untransformed B_. amyloliquefaciens are shown 1n 
Figure 4. Lane 3 shows authentic subtil 1s1n from B. amyloliquefaciens . 
Lane 2 which 1s the supernatant from pBS42 transformed B. subtills. 
does not give the 31,000 MW band associated with subtil Isln which is 

IB exhibited by Lane 1 from pS4 transformed hosts. The approximately 
31,000 KW band result for subtillsln Is characteristic of the slower 
mobility shown by the known M.W. 27,500 subtillsln preparations in 
general . 

20 

Example 3 

Sequencing of the B. amyloliquefaciens Subtilisin Gene 

25 The entire sequence of an EcoRI-BamHI fragment (wherein the EcoRI 
site was constructed by conversion of the Hindi site) of pS4 was 
determined by the method of F. Sanger, 1977, "Proc. Natl. Acad. Scl 
(USA)", 74:5463. Referring to the restriction map shown in Figure 3, 
the BamHI-PvuII fragment was found to hybridize with pool 1 

30 ollgonucleotides'by Southern analysis* Data obtained from sequencing 
of this fragment directed the sequencing of the remaining fragments 
(e.g. PvuII-HincII and Aval-Aval). The results are shown in Figure 1. 

Examination of the sequence confirms the presence of codons for 
35 the mature subtillsln corresponding to that secreted by the 
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IB. amyloliquefaciens. Immediately upstream from this sequence fs a 
series of 107 codons beginning with tlie GTG start codon at -107. 
Codon -107 to approximately codon -75 encodes an amino acid sequence 
whose characteristics correspond to that of known signal sequences • 

5 (Most such signal sequences are 18-30 amino adds in length, have 
hydrophobic cores, and terminate In a small hydrophobic amino add J 
Accordingly, examination of the sequence data would indicate that 
codons -107 to approximately -75 encode the signal sequence; the 
remaining Intervening codons between -75 and -1 presumably encode a 

10 prosequence. 



Example 4 

15 Construction of pBS42 

pBS42 Is formed by three^w^y ligation of fragments derived from 
pUBHO, pC194, and pBR322 (see Figure 5), The fragment from pUBllO is 
the approximately 2600 base pair fragment between the Hpall site at 

20 1900 and the BamHl site at 4500 and contains an origin of replication 
operable in Bacillus: T. Grycztan, et al_., 1978 "J. Bacterlol.% 134 : 
318 (1978); A. Jalanko, et aU f 1981 "Gene V14:* 325. The BamHI site 
was tested with Klenow. The pBR322 portion is the 1100 base pair 
fragment between the PvuII site at 2067 and the Sau3A site at 3223 

25 which contains the E« coli origin of replication: F. Bolivar, e t a 1 . , 
1977 "Gene\ 2: 95; J. Sutcliffe, 1978, Cold Spring Harbor Symposium 
43: I, 77. The pC194 fragment is the 1200 base pair fragment between 
the Hpall site at 973 and the Sau3A site at 2006 which contains the 
gene for chloramphenicol resistance expressible in both E. coli and B* 

30 subtil 1s : S. Ehrtlch, "Proc. Natl. Acad* Sci. (USA) M , 74:1680; 
S. Horynuchl et al., 1982, "J* Bacterio!." 150: 815. 

pBS42 thus contains origins of replication operable both in IE. 
coll and in Bacillus and an expressible gene for chloramphenicol 
35 resistance. 



0992Y 



Example 5 

Isolation and Sequencing of the B. subtilis Subtilisin Gene 

5 JL- subtil is 1168 chromosomal DNA was digested with EcoRI and the 
fragments resolved on gel electrophoresis- A single 6 kb fragment 
hybridized to a [a- 32 P] CTP nick translation - labelled fragment 
obtained from the C-temfmis of the subtilisin structural gene in pS4, 
described above- Hie 6 kb fragment was electroluted and ligated Into 

10 pBS42 which had been digested with EcoRI and treated with bacterial 
alkaline* phosphatase. E. coli ATCC 31446 was transformed with the 
ligation mixture and transfonnants selected by growth on LB agar 
containing 12-5 pg chloran^heni col/ml. Plasmid DMA was prepared from 
a pooled suspension of 5,000 transformed colonies. This DNA was 

15 transformed Into B. subtil is BG84, a protease deficient strain, the 
preparation of which is described in Example 8 below. Colonies which 
produced protease were screened by plating on LB agar plus 1.5 percent 
w/w Carnation powdered nonfat skim milk and 5 i*g chloramphenicol /ml 
(hereafter termed skim milk selection plates) and observing for zones 

20 of clearance evidencing proteolytic activity. 

Plasmid DNA was prepared from protease producing colonies, 
digested with EcoRI, and examined by Southern analysis for the 
presence of the 6 kb EcoRI insert by hybridization to the 

25 32 P-label1ed terminus fragment of the subtilisin structural gene 
from B. amyloliquefaciens. A positive clone was identified and the 
plasmid was designated pS168.1- B. subtilis B684 transformed with 
pS168.1 excreted serine protease at a level 5-fold over that produced 
in B. subtilis 1168- Addition of EDTA to the supematants did not 

30 affect the assay results, but the addition of PMSF 

(phenyl methyl sufonyl fluoride) to the supematants reduced protease 
activity to levels undetectable in the assay described In Example 8 
for strain BG84. 



35 



-29- 



0247647 



A restriction map of the 6.5 kb EcoRI Insert is shown in Fig. 6. 
The subtillsin gene was localized to within the 2.5 kb KpnI-EcoRI 
fragment by subcloning various restriction enzyme digests and testing 
for expression of subtllisln 1n B. subtil is BG84. Southern analysis 

6 with the labelled fragment from the C-terminus of the 

B_. amylol i que faciens subtil isin gene as a probe localized the 
C-terminus of the B. subtil is gene to within or part of the 631 bp 
Hindi fragment 6 In the center of this subclone (see F1g. 6). The 
tandem Hindi fragments B, C, and D and HincII-EcoRI fragment E 

10 (Fig. 6) were ligated into the M13 vectors mp8 or mp9 and sequenced In 
known fashion (J. Messing et al_., 1982. "Gene" 19:209-276) using 
dideoxy chain termination (F. Sanger et al_. , 1977, "Proc. Mat. Acad. 
Sci. U.S.A." 74:5463-5467). The sequence of this region 1s shown in 
Fig. 7. The first 23 amino acids are believed to be a signal 

15 peptide. The remaining 83 amino acids between the signal sequence and 
the mature coding sequence constitute the putative "pro" sequence. 
The overlined nucleotides at the 3* end of the gene are believed to be 
transcription terminator regions. Two possible Shine-Dai garno 
sequences are underlined upstream from the mature start codon. 

20 

Example 6 

Manufacture of an Inactivating Mutation of the B. subtills 
25 Subtil isin Gene 

A two step ligation, shown In Fig. 8, was required to construct a 
plasmld carrying a defective gene which would Integrate into the 
Bacillus chromosome. In the first step, pS168.1, which contained the 

30 6.5 kb Insert- originally recovered from the B. subtil is genomic 

library as described In Example 5 above, was digested with EcoRI, the 
reaction products treated with Klenow, the DNA digested with Hindi, 
and the 800 bp EcoRI-HincII fragment E (see Fig. 6) that contains, 1n 
part, the 5' end of the B^. subtills subtilisin gene, was recovered. 

35 This fragment was ligated Into pJHIOl {pJHIOl is available from 
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J. Hoch (Scrfpps) and is described by F.A. Ferrari et aK, 1983, 
"J. Bact." 134:318-329) that had been digested with Hindi and treated 
with bacterial alkaline phosphatase. The resultant plasmid, pIDVl, 
contained fragment E in the orientation shown in Fig. 8. In the 

5 second step, pS168.1 was digested with Hindi and the 700 bp Hindi 
fragment B, which contains the 3' end of the subtil isin gene, was 
recovered. pIDVl was digested at its unique Hindi site and 
fragment B ligated to the linearized plasmid, transformed in E. coli 
ATCC 31,446, and selected on LB plates containing 12.5 ug 

10 chloramphenicol /ml or 20 tig ampici II In/ml. One resulting plasmid, 
designated pIDV1.4, contained fragment B in the correct orientation 
with respect to fragment E. This plasmid pIDV1.4, shown in Fig. 8, is 
a deletion derivative of the subtHlsin gene containing portions of 
the 5' and 3' flanking sequences as well. 

1S 

B. subtilis BG77, a partial protease-deflcient mutant {Prt +/ ") 
prepared in Example 8 below was transformed with pI0Y1.4. Two classes 
of chloramphenicol resistant (Cra r ) transformants were obtained. 
Seventy-five percent showed the same level of proteases as BS77 

20 (Prt + '~) and 25 percent were almost completely protease deficient 
(Prt~) as observed by relative zones of clearing on plates containing 
LB agar plus skim milk. The Cm r Prt~ transformants could not be 
due to a single crossover Integration of the plasmid at the homologous 
regions' for fragment E or B because, in such a case, the gene would be 

25 uninterrupted and the phenotype would be Prt +/ ~. In fact, when 

either of fragments E or B were ligated Independently Into pJHIOl and 
subsequently transformed into B. subtil 1s BG77, the protease deficient 
phenotype was not observed. The Cm r phenotype of Cm r Prt~ 
pIDV1.4 transformants was unstable in that Cm s Prt" derivatives 

30 could be Isolated from Cra r Prt" cultures at a frequency of about 
0.1 percent after 10 generations of growth 1n minimal medium In the 
absence of antibiotic selection. One such derivative was obtained and 
designated B62018. The deletion was transferred Into IA84 (a B8SC 
strain carrying two auxotrophic mutations Hanking the subtillsln gene) 

35 by PBS1 transduction. The derivative organism was designated BG2019. 
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Example 7 

Preparation of a Genomic DMA Library from B. subtil is and 
Isolation of its Heutral Protease Gene 

5 

The partial amino acid sequence of a neutral protease of 
B. subtilis is. disclosed by P. Levy et al* 1975, "Proc. Hat- Acad. 
Sci. USA" 72:4341-4345. A region of the enzyme (Asp Gin Met He Tyr 
. Gly) was selected from this published sequence in which the least 
10 redundancy existed in the potential codons for the amino acids in the 
region. -24 combinations were necessary to cover all the potential 
coding sequences, as described below. 

. GA J CA J ATS AT I TA C GG 
15 A 

Asp Gin Met He Tyr Gly 

Four pools, each containing six alternatives, were prepared as 
20 described above In Example 1. The pools were labelled by 
phosphorylation with [t- 32 ?] ATP. 

The labelled pool containing sequences conforming closest to a 
unique sequence in a B. subtilis genome was selected by digesting 

25 B. subtilis (1A72, Bacillus Genetic Stock Center) DNA with various 
restriction enzymes* separating the digests on an electrophoresis 
gel, and Hybridizing each of the four probe pools to each of the 
blotted digests under Increasingly stringent conditions until a 
single band was seen to hybridize. Increasingly stringent 

30 conditions are those which tend to disfavor hybridization, e.g., 
increases in forraaraide concentration, decreases In salt 
concentration and Increases In temperature. At 37*C in a solution 
of 5x Denhardt's, 5x SSC t 50 mM NaP0 4 pH 6.8 and 20 percent 
formamide, only pool 4 would hybridize to a blotted digest. These 

35 were selected as the proper hybridization conditions to be used for 
the neutral protease gene and pool 4 was used as the probe. 
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A lambda library of B. subtilis strain BGSC 1-A72 was prepared 
in conventional fashion by partial digestion of the Bacillus genomic 
DNA by Sau3A, separation of the partial digest by molecular weight 
on an electrophoresis gel, elution of 15-20 kb fracpoents (R* Lawn 
S et aU, 1981, "Nucleic Acids Res." ^6103-6114), and ligation of the 
fragments to BamHI digested charon 30 phage using a Packagene kit 
from Promega Biotec. 

E. coll DPSOsupF was used as the host for the phage library, 
10 although any known host for Charon lambda phage Is satisfactory. 
The E. coll host was plated with the library phage and cultured, 
after which plaques were assayed for the presence of the neutral 
protease gene by transfer to nitrocellulose and screening with probe 
pool * (Benton and Davis, 1977, "Science" 196;180-182). Positive 
15 plaques were purified through two rounds of single plaque 
purification, and two plaques were chosen for further stutty, 
designated xNPRGl and XHPRG2. DMA was prepared from each phage by 
restriction enzyme hydrolysis and separation on electrophoresis 
gels. The separated fragments were blotted and hybridized to 
20 labelled pool 4 oligonucleotides. This disclosed that aNPRGI 

contained a 2400 bp Hindlll hybridizing fragment, but no 4300 EcoRI 
fragment, while xNPRG2 contained a 4300 bp EcoRI fragment, but no 
2400 bp Hindlll fragment. 

2s The 2400 bp xNPRGl fragment was subcloned into the Hindlll site 

of pJHIOl by the following method. xNPRGl was digested by Hindlll, 
the digest fractionated by electrophoresis and the 2400 bp fragment 
recovered from the gel. The fragment was Itgated to alkaline 
phosphatase- treated Hindlll digested pJHIOl and the ligation mixture 

30 used to transform E. coli ATCC 31446 by the calcium chloride shock 
method of V. Hershfield et aU, 1974, "Proc. Hat. Acad. Sci. 
(U.S.A.)" 79:3455-3459}* Transformants were Identified by selecting 
colonies capable of growth on plates containing LB medium plus 
12.5 ug chloramphenicol /ml. 

35 
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Transformant colonies yielded several plasraids. The orientation 
of the 2400 bp fragment in each plasmid was determined by 
conventional restriction analysis (orientation Is the sense reading 
or transcriptional direction of the gene fragment in relation to the 
5 reading direction of the expression vector into which 1t is 

ligated.) Two plasraids with opposite orientations were obtained and 
designated pNPRsubK6 and pNPRsubHl. 

The 4300 bp EcoRI fragment of *NPRG2 was subcloned Into pBR325 
, 0 by the method described above for the 2400 bp fragment except that 
XNPRG2 was digested With EcoRI and the plasraid was alkaline 
phosphatase-treated, EcoRI-digested pBR325. pBR325 1s described by 
F. Bolivar, 1978, "Gene" 4:121-136. Two plasraids were identified In 
which the 4300 bp insert was present In different orientations, 
is These two plasmlds were designated pMPRsubRI and pNPRsubRIb. 



Example 6 

20 Characterization of B. subtllis Neutral Protease Gene 

The pNPRsubHl insert was sequential ly^ digested with different 
restriction endonucleases and blot hybridized with labelled pool 4 
1n order to prepare a restriction map of the insert (for general 

25- procedures of restriction mapping see T. Haniatis et al_., Id., 
p. 377). A 430 bp Rsal fragment was the smallest fragment that 
hybridized to probe pool 4. The Rsal fragment was ligated into the 
Smal site of M13 mp8 (J. Messing et al_.» 1982, "Gene" 19:269-276 and 
J. Messing in Methods in Enzymology, 1983, R. Hu et a^., Eds., 

3Q 101:20-78) and the sequence determined by the chain-terminating 
dldeoxy method (F. Sanger et aj_., 1977, "Proc. Mat. Acad. Sci. 
. U.S.A." 74:5463-5467). Other restriction fragments from the 
pNPRsubHl Insert were ligated into appropriate sites in M13 mp8 or 
M13 mp9 vectors and the sequences determined. As required, dITP was 

35 used to reduce compression artifacts (0. Mills et a]_., 1979, "Proc. 
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Nat. Acad. Set. (U.S.A.}" 76:2232-2235). The restriction map for 
the pNPRsubHl fragment is shown in Fig. 9. The sequences of the 
various fragments from restriction enzyme digests were compared and 
an open reading frame spanning a codon sequence translatable into 
5 the amino and carho^yl termini of the protease (P. Levy et Id,) 
was determined. An open reading frame is a DNA sequence commencing 
at a known point which in reading frame (every three nucleotides) 
does not contain any internal termination codons. The open reading 
frame extended past the amino terminus to the end of the 240Q bp 

10 Hindll I fragment. The 1300 bp Bglll - Hindi II fragment was prepared 
from pNPRsubRIb (which contained the 4300 bp EcoRI fragment of 
XNPRG2) and cloned in M13 mp8. The sequence of this fragment, which 
contained the portion of the neutral protease leader region not 
encoded by the 2400 bp fragment of pNPRsubHl, was determined for 400 

15 nucleotides upstream from the Hindlll site. 

The entire nucleotide sequence as determined for this neutral 
protease gene, including the putative secretory leader and prepro 
sequence, are shown in Fig. 10. The numbers above the line refer to 

20 amino acid positions. The underlined nucleotides in Fig. 10 are 
believed to constitute the ribosome binding (Shine-Dai garno) site, 
while the overlined nucleotides constitute a .potential hairpin 
structure presumed to be a terminator. The first 27 - 28 of the 
deduced amino acids are believed to be the signal for the neutral 

25 protease, with a cleavage point at ala-27 or ala-28. The "pro" 
sequence of a proenzyme structure extends to the ami no-terminal 
amino acid (ala-222) of the mature, active enzyme* 

A high copy plasmid carrying the entire neutral protease gene 
30 was constructed by (Fig. 11) ligatlng the Bglll fragment of 

pNPRsubRl, which contains 1900 bp (Fig. 9J, with the PvuII - Hindlll 
fragment of pNPRsubHl, which contains 1400 bp . pBS42 (from 
Example 4) was digested with BamHI and treated with bacterial 
alkaline phosphatase to prevent plasmid reclrcularlzation. 
35 pHPRsubRl was digested with Bglll, the 1900 bp fragment was Isolated 
0992Y 



« 



- 35 - 0247647 

from gel electrophoresis and Ugated to the open BamHI sites of 
p8S42. The Ugated plasmid was used to transform E. coll ATCC 31446 
by the calcium chloride shock method (V. Hershfleld et al_., Id.), 
and transformed cells selected by growth on plates containing LB 

5 medium with 12.5 ng/ml chloramphenicol. A plasmid having the Bgl II 
fragment in the orientation shown in Fig. 11 was isolated from the 
transformants and designated pNPRsubBl. pNPRsubBl was digested 
(linearized) with EcoRI. repaired to flush ends by Klenow treatment 
and then digested with Hiudlll. The larger fragment from the 

10 Hindlll digestion (containing the sequence coding for the amino 
terminal *and upstream regions) was recovered. 



The carboxyl terminal region of the gene was supplied by a 
fragment from pNPRsubHl, obtained by digestion of pNPRsubHl with 

l 5 PvuII and Hindi II and recovery of the 1400 bp fragment. The flush 
end PvuII and the Hindlll site of the 1400 bp fragment was Ugated, 
respectively, to the blunted EcoRI and the Hindlll site of 
pNPRsubBl, as shown 1n Fig. 11. This construct was used to 
transform B. subtil 1s strain BG84 which otherwise excreted no 

20 proteolytic activity by the assays described below. Transformants 
were selected on plates containing LB medium plus 1.5 percent 
carnation powdered nonfat milk and 5ng/ml. chloramphenicol. Plasmlds 
from colonies that cleared a large halo were analyzed. Plasmid 
pNPRlO, Incorporating the structural gene and flanking regions of 

25 the neutral protease gene, was determined by restriction analysis to 
have the structure shown in Fig. 11. 

B. subtil is strain B684 was produced by N-raethyl -N ' -nitro-M- 
nltrosoguanldine (NTG) mutagenesis of B. subtilis 1168 according to 

30 the general technique of Adelberg et a]_., 1965, "Biochem. Blophys. 
Res. Commun." 18:788-795. Mutageni zed strain 1168 was plated on 
skim milk plates (without antibiotic). Colonies producing a smaller 
halo were picked for further analysis. Each colony was 
characterized for protease production on skim milk plates and 

35 amylase production on starch plates. One such Isolate, which was 
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partially protease deficient, amylase positive and capable of 
sporulation, was designated BG77. The protease deficiency mutation 
was designated prt-77. The prt>77 allele was moved to a spoOA 
background by congression as described below to produce strain BG84, 
5 a sporulatfon deficient strain. 



Table A 

10 Strain Relevant Genotype origin 

1168 trpC2 

JH703 trpC2, pheA12, spo0A&677 Trousdale et al.a 

BG16 ' £urB6, metB5, leuA8, lys-21 , MsA, thr-5 Pb 1665 
15 — — 

sacA321 

BG77 trpC2, prt-77 NTG x 1168 

BG81 metB5, prt-77 BG16 DMA x BG77 

20 BG84 spo0a677, prt-77 0H703 DMA x BG81 



« "Mol. Gen. Genetics" 173:61 (1979) 



25 BG84 was completely devoid of protease activity on skim milk 
plates and does not produce detectable levels of either subtilisln 
or neutral protease when assayed by measuring the change in 
absorbance at 412 nm per minute upon Incubation with 0.2 wg/ml 
sucdnyl (-L-ala-L-ala-L-pro-L-phe) p-nitroanlllde (Vega) 1n 0.1 M 

30 sodium phosphate, pH 8, at 25°C. BG84 was deposited in the ATCC as 
deposit number 39382 on July 21, 1983. Samples for subtil Isin assay 
were taken from late logarithmic growth phase supernatants of 
cultures grown in modified Schaeffer's medium (T. Leighton et al_.» 
1971, M. Biol. Chenu* 246:3189-3195). 

35 
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Example 9 

Expression of the Neutral Protease Gene. 

5 BG84"transformed with pHPRlO was Inoculated into minimal media 
supplemented with 0.1 percent casein hydrolysate and 10 »ig 
chloramphenicol and cultured for 16 hours. 0.1 ml of culture 
supernatant was removed and added to a suspension of 1.4 rag/ml 
Azocoll proteolytic substrate (Sigma) in 10 raM Trls-HCl, 100 mM HaCl 

10 pH 6.8 and Incubating with agitation. Undigested substrate was 
removed -by centWfugatlon and the optical density read at 505 nra. 
Background values of an Azocoll substrate suspension were 
subtracted. The amount of protease excreted by a standard 
protease-expresslng strain, BG16 was used to establish an arbitrary 

15 level of 100. The results with BG16, and with BG84 transformed with 
control and neutral protease gene-containing plasralds are shown 1n 
Table B 1n Example 12 below. Transformation of the excreted 
protease-devold B. subtil is strain BG84 results 1n excretion of 
protease activity *t considerably greater levels than 1n BG16, the 

20 wild-type strain. 



Example 10 

25 Manufacture of an Inactivating Mutation of the Heu tral Protease 

Gene 

The two Rsal bounded regions in the 2400 bp insert of pNPRsubHl, 
totalling 527 bp, can be deleted In order to produce an Incomplete 

30 structural gene. The translatlonal products of this gene are 
enzymatlcally inactive. A plasmid having this deletion was 
constructed as follows. pJHIOl was cleaved by digestion with 
Hindlll and treated with bacterial alkaline phosphatase. The 
fragments of the neutral protease gene to be Incorporated Into 

35 linearized pJHIOl were obtained by digesting pNPRsubHl with Hindlll 
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and Rsal, and recovering the 1200 bp Hindlll-Rsal and 680 bp 
Rsal-Hindlll fragments by gel electrophoresis. These fragments were 
Hgated into linearized pJHIOl and used to transform E. coll 
ATCC 31446. Transformants were selected on plates containing LB 
5 medium and 20 P g ampicillin/ral* Plasmids were recovered from the 
transformants and assayed by restriction enzyme analysis to identify 
a plasmid having the two fragments in the same orientation as in the 
pNPRsubHl starting plasmid. The plasmid lacking the Internal Rsal 
fragments was designated pNPRsubHU. 

10 

Example 11 

Replacement of the Heutral Protease Gene with a Deletion Mutant 

15 

Plasmid pMPRsubhlA was transformed into jl. subtil is strain 
BG2019 (the subtil i sin deleted mutant from Example 6) and 
chromosomal integrants were selected on skim milk plates* Two types 
of Cm r transformants were noted, those with parental levels of 

20 proteolysis surrounding the colony, and those with almost no zone of 
proteolysis. Those lacking a zone of proteolysis were picked, 
restreaked to purify Individual colonies,. and .their protease 
deficient character on skim milk plates confirmed. One of the 
Cra r , proteolysis deficient colonies was chosen for further studies 

25 (designated 8G2034). Spontaneous Cm s revertants of BG2034 were 
isolated by overnight growth 1n LB media containing no Cm, plating 
for individual colonies, and replica plating on media with and 
without Cm. Three Cm s revertants were Isolated, two of which were 
protease proficient, one of which was protease deficient (designated 

30 BG2036). Hybridization analysis of BG2036 confirmed that the 

plasmid had been lost from this strain, probably by recombination, 
leaving only the deletion fragments of subtil isin and neutral 
protease. 

35 
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Example 12 

Phenotype of Strains Lacking Functional Subt nisin and Neutral 

Protease 
1— 

The growth, spoliation and expression of proteases was examined 
in strains lacking a functional gene for either the neutral or 
alkaline protease or both. The expression of proteases was examined 
by a 2one of clearing surrounding a colony on a skim milk plate and 
by measurement of the protease levels in liquid culture supernatants 
(Table B). A strain (B62035) carrying the subtnisin gene deletion, 
and showed a 30 percent reduction level of protease activity and a 
normal halo on milk plates. Strain B62043, carrying the deleted 
neutral protease gene and active subtnisin gene, and constructed by 
transforming B616 (Ex. 8) wfth DMA from BG2036 (Example 11), showed 
an 80 percent reduction In protease activity and only a small halo 
on the milk plate. Strain BG2054, considered equivalent to BG2036 

Table B 

Effect of protease deletions on protease expression and sporulation. 



Genotype 3 Protease activity 5 Percent Sporulation 



BG16 

BG2035 

BG2043 

BG2054 

BG84(pBS42) 

BG84(pNPR10T) 



Wild type 100 

apr*684 70 

nprE&522 20 

apra684,nprEaS22 NO 

spoOM677,prt-77 NO 

spo0Aa677,prt-77 3000 



40 
20 
20 
45 



a 0nly the loci relevant to the protease phenotype are shown. 

b P.rotease activity is espressed in arbitrary units, BG16 was assigned a 

level of 100. NO indicates the level of protease was not detectable 1n 

the assay used. 
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{Example 11) in that it carried the foregoing deletions in both 
genes, showed no detectable protease activity in this asssy and no 
detectable halo on milk plates* The deletion of either or both of 
the protease genes had no apparent effect on either growth or 
sporulation. Strains carrying these deletions had normal growth 
rates on both minimal glucose and LB media. The strains sporulated 
at frequencies comparable to the parent strain BG16. Examination of 
morphology of these strains showed no apparent differences from 
strains without such deletions. 

Example 13 



Site-specific Saturation Mutagenesis of the B« Amyloliquefaciens 
Subtillsin Gene at Position 222; Preparation of the Gene for 
15 Cassette Insertion 

pS4-5, a derivative of pS4 made according to Hells et al. , 
"Nucleic Acids Res.", 1983, 11:7911-7924 was digested with EcoRI and 
BamHI, and the 1.5 kb EcoRI -BamHI fragment recovered. This fragment 

20 was Tigated into replicative form N-13 mp9 which had been digested 
with EcoRI and BamHI (Sanger et al_., 1980, M. MoT. Biol.- 143 
161-178. Messing et art, 1981, "Nucleic Acids Research" £, 304-321. 
Messing, 0. and Vlelra, J. (1982) Gene 19^ 269-276). The M-13 mp9 
phage ligations, designated M-13 mp9 SUBT, were used to transform 

25 £• coli strain 0M101 and single stranded phage DNA was prepared from 
a two mL overnight culture. An oligonucleotide primer was 
synthesized having the sequence 

5 1 -GTACAACGGTACCTCACGCACGCTGCAGGAGCGGCT6C-3 \ This primer conforms 
to the sequence of the subtil is gene fragment encoding amino acids 

30 216-232 except that the 10 bp of codons for amino acids 222-225 were 
deleted, and the codons for amino adds 220 , 227 and 228 were 
mutated to introduce a Kpnl site 5 1 to the met-222 codon and a PstI 
site 3 1 to the met+222 codon. See Fig. 12. Substituted nucleotides 
are denoted by asterisks, the underlined codons in line 2 represent 

35 the new restriction sites and the scored sequence in line 4 
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represents the Inserted oligonucleotides. The primer (about 15 nM) 
was labelled with [ 32 p] by incubation with l> 32 p]-ATP (10 pi in 
20 jiL reaction) (Amershara 5000 Cl/ranol, 10218) and T 4 
polynucleotide kinase (10 units) followed by non-radioactive ATP 
5 (100 nMKto allow complete phosphorylation of the mutagenesis 
primer. The kinase was Inactivated by heating the phosphorylation 
mixture at 68°C for 15 min. 

The primer was hybridized to H-13 mp9 SUBT as modified from 

10 Norrls et al., 1983, "Nucleic Acids Res." 11, 5103-5112 by combining 
5 ul of the labelled- mutagenesis primer ("3 pH), "1 pg M-13 mp9 SU8T 
template, 1 pL of 1 pM M-13 sequencing primer (17-mer), and 2.5 pL 
of buffer (0.3 N Tris pH 8, 40 mM MgCl 2 , 12 mM EDTA, 10 mM DTT, 
0.5 mg/ml BSA). The mixture was heated to 68'C for 10 minutes and 

15 cooled 10 minutes at room temperature. To the annealing mixture was 
added 3.6 pL of 0.25 mM d6TP, dCTP, dATP, and dTTP, 1.25 pL of 10 mM 
ATP, 1 pL Hgase (4 units) and 1 pL Klenow (5 units). The primer 
extension and ligation reaction (total volume 25 ul) proceeded 
2 hours at 14°C. The Klenow and ligase were Inactivated by heating 

20 to 68'C for 20 min. The heated reaction mixture was digested with 
BamHl and EcoRI and an aliquot of the digest was applied to a 6 
percent polyacryl amide gel and radioactive fragments were visualized 
by autoradiography. This showed the [^P] mutagenesis primer had 
indeed been Incorporated into the EcoRI-BamHl fragment containing 

25 the now mutated subtil 1 sin gene. 

The remainder of the digested reaction mixture was diluted to 
200 ul with 10 mM Tris, pH 8, containing 1 mM EDTA, extracted once 
with a 1:1 (v:v) phenol /chloroform mixture, then once with 

30 chloroform, and the aqueous phase recovered. 15 pL of 5M ammonium 
acetate (pH 8) was added along with two volumes of ethanol to 
precipitate the DMA from the aqueous phase. The DMA was pelleted by 
centrlfugation for five minutes in a mlcrofuge and the supernatant 
was discarded. 300 pi of 70 percent ethanol was added to wash the 

35 DMA pellet, the wash was discarded and the pellet lyophilized. 
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pBS42 from example 4 above was digested with BamHl and EcoRI and 
purified on an acrylamide gel to recover the vector. 0.5pg of the 
digested vector, 50uM ATP and 6 units ligase were dissolved in 20 nl 
of ligation buffer. The ligation went overnight at 14°C. The DNA 
was transformed into E. coli 294 rec + and the transfonnants grown 
in 4 ml of LB medium containing 12.5 ng/ml chloramphenicol - Plasmid 
DMA was prepared from this culture and digested with Kpnl, EcoRI and 
BaraHI. Analysis of the restriction fragments showed 30-50 percent 
of the molecules contained the expected Kpnl site programmed by the 
mutagenesis primer. It was Jjypothestzed that the plasmid population 
not Including the Kpnl site resulted from M-13 replication before 
bacterial repair of the mutagenesis site, thus producing a 
heterogenous population of Kpnl* and Kpnl" plasralds 1n some of 
the transfonnants. In order to obtain a pure culture of the Kpnl + 
plasmid, the DNA was transformed a second time into E. coll to clone 
plasmids containing the new Kpnl site. DMA was prepared from 16 
such transfonnants and six were found to contain the expected Kpnl 
site. 

Preparative amounts of DNA were made from one of these six 
transfonnants (designated pa222) and restriction analysis confirmed 
the presence and location of the expected Kpnl and PstI sites. 40 
ng of pA222 were digested in 300 $iL of Kpnl buffer plus 30 W L Kpnl 
(300 units) for 1.5 h at 37*C. The DMA was precipitated with 
ethanol, washed with 70 percent ethanol, and lyophilized. The DMA 
pellet was taken up in 200 M L Hindlll buffer and digested with 20 nL 
(500 units) PstI for 1.5 h at 37°C. The aqueous phase was extracted 
with phenol/CHCl 3 and the DMA precipitated with ethanol. The DMA 
was dissolved in water and purified by polyacryl amide gel 
electrophoresis. Following electroelution of the vector band (120 v 
for 2 h at 0 P C in 0.1 times TBE (Maniatis et art Id.)) the DNA was 
purified by phenol/CKCl 3 extraction, ethanol precipitation and 
ethanol washing. 
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Although P a222 could be digested to completion (>98 percent) by 
either Knpl or PstI separately, exhaustive double digestion was 
incomplete («50 percent). This may have resulted from the fact 
that these, sites were so close (10 bp) that digestion by Knpl 
allowed "breathing" of the DMA in the vicinity of the PstI site, 
i.e., strand separation or fraying. Since PstI will only cleave 
double stranded DMA, strand separation could Inhibit subsequent PstI 
digestion. 



Example 14 

Ligation of Oligonucleotide Casettes into the Subtillsin Gene 

15 10 wM of four complementary oligonucleotide pools (A-D) Table 

C which were not 5' phosphorylated were annealed in 20 ul 
ligase buffer by heating for five minutes at 68*C and then cooling 
for fifteen minutes at room temperature. 1 i»M of each annealed 
oligonucleotide pool, ~0.2 „g Kpnl and Pstl-digested pa222 obtained 

20 in Example 13, 0.5 mM ATP, ligase buffer and 6 units T 4 DMA ligase 
1n 20 tiL total volume was reacted overnight at 14°C to Hgate the 
pooled cassettes in the vector. A large excess of cassettes T300x 
over the p&222 ends) was used 1n the ligation to help prevent 
intramolecular Kpnl-Kpnl ligation. The reaction was diluted by 

25 adding 25 pL of 10 mM THs P H 8 containing 1 mM EOTA. The mixture 
was reannealed to avoid possible cassette concatemer formation by 
heating to 68'C for five minutes and cooling for 15 minutes at room 
temperature. The ligation mixtures from each pool were transformed 
separately Into E. coll 294 rec + cells. A small aliquot from each 

30 transformation mixture was plated to determine the number of 
independent transformants. The large number of transformants 
indicated a high probability of multiple mutagenesis. The rest of 
the transformants ("200-400 transformants) were cultured in 4 ml of 
LB medium plus 12.5 ng chloramphenicol /ml . DNA was prepared from 

35 each transformation pool (A-D). This ONA was digested with Kpnl. 
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"0.1 ug was used to retransform E. coli rec + and the mixture was 
plated to isolate individual colonies from each pool. Ligation of 
the cassettes into the gene and bacterial repair upon transformation 
destroyed the Kpnl and PstI sites. Thus, only pA222 was cut when 

5 the transformant DMA was digested with Kpnl- The cut plasraid would 
not transform E^ coli . Individual transformants were grown in 
culture and DNA was prepared from 24 to 26 transformants per pool 
for direct plasraid sequencing- A synthetic oligonucleotide primer 
having the sequence 5 l -GAGCTTGATGTCATG6C-3 ' was used to prime the 

10 dideoay -sequencing reaction. The mutants which were obtained are 
described in Table C below* 

Two codon+222 mutants (I.e., gin and ile) were not found after 
the screening described. To obtain these a single 25raer 

16 oligonucleotide was synthesized for each mutant corresponding to the 
top oligonucleotide strand in Figure 12. Each was phosphorylated 
and annealed to the bottom strand of its respective 
nonphosphoryl-ated oligonucleotide pool (I.e., pool A for gin and 
pool D for ile). This was ligated into Kpnl and PstI digested pa222 

20 and processed as described for the original oligonucleotide pools. 
The frequency of appearance for single mutants obtained this way was 
2/8 and 0/7 for gin and ile, respectively. Tq avoid this apparent 
bias the top strand was phosphorylated and annealed to its 
unphosphorylated complementary pool. The heterophosphorylated 

25 cassette was ligated into cut p&222 and processed as before. The 
frequency of appearance of gin and ile mutants was now 7/7 and 7/7, 
respectively. 

The data in Table C demonstrate a bias in the frequency of 
30 mutants obtained from the pools. This probably resulted from 
unequal representation of oligonucleotides in the pool. This may 
have been caused by unequal coupling of the particular trimers over 
the mutagenesis codon in the pool. Such a bias problem could be 
remedied by appropriate adjustment of triraer levels during synthesis 
35 to reflect equal reaction. In any case, mutants which were not 
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isolated in the primary screen were obtained by synthesizing a 
single strand oligonucleotide representing the desired mutation, 
phosphorylating both ends, annealing to the pool of 
non-phosphorylated complementary strands and ligating into the 
cassette site, A biased heteroduplex repair observed for the 
completely unphosphorylated cassette may result from the fact that 
position 222 is closer to the 5' end of the upper strand than it 1s 
to the 5' end of the lower strand (see Figure 12] . Because a gap 
exists at the unphosphorylated 5" ends and the mismatch bubble 1n 
the double stranded DMA Is at position 222, excision repair of the 
top strand gap would more readily maintain a circularly hybridized 
duplex capable of replication. Consistent with this hypothesis 1s 
the fact that the top strand could be completely retained by 
selective V phosphorylation. In this case only the bottom strand 
contained a 5 1 gap which could promote excision repair* This method 
is useful in directing biased incorporation of synthetic 
oligonuclotlde strands when employing mutagenic oligonucleotide 
cassettes. 



20 

Example IS 

Site-Specific Mutagenesis of the Subtilisin Gene at Position, 166 

The procedure of Examples 13-14 was followed in substantial 
detail, except that the mutagenesis primer differed (the 37 mer 
shown in Fig. 13 was used), the two restriction enzymes were SacI 
and Xmalll rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 13, 

Bacillus strains excreting mutant subtilisins at position 166 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala, asp, gin, phe, his, 
lys, asn, arg, and val for the wild-type residue were recovered. 
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Example 16 
Preparation of Mutant Subtil 1sin Enzymes 

5 B. subtnis strain BG2036 obtained by the method of Example 11 

was transformed by the plasmlds of Examples 14, 15 or 20 and by 

pS4-5 as a control. Transformants were plated or cultured In shaker 

flasks for 16 to 48 h at 37*C 1n LB media plus 12.5 pg/ml 

chloramphenicol. Mutant enzymatlcally active subtilisin was 

1Q recovered by dlalyzlng cell broth against 0.01N sodium phosphate 

buffer, pH 6.2. The dlalyzed broth was then titrated to pH 6.2 with 

IN HC1 and loaded on a 2.5 x 2 cm column of CM cellulose (CM-52 

Whatman). After washing with 0.01M sodium phosphate, pH 6.2, the 

subt1lis1ns (except mutants at position +222) were eluted with the 

... same buffer made 0.08N in NaCl. The mutant subtilisins at position 
15 

+222 were each eluted with 0.1M sodium phosphate, pH 7.0. The 
purified mutant and wild type enzymes were then used 1n studies of 
oxidation stability, Km, Kcat, Kcat/Kra ratio, pH optimum, and 
changes 1n substrate specificity. 



20 



25 



30 



35 
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Table C Oligonucleotide Pool Organization 
and Frequency of Mutants Obtained 



Pool Amino Acids 

A asp 
met 
cys 
arg 
gin 



10 B leu 

pro 
phe 
tyr 
Ms 



glu 
ala 
thr 
lys 
asn 

unexpected mutants 



D gly 
20 trp 



He 
ser 
val 

unexpected mutants 



Codon-222a 


Frequency^ 




GAT 


2/25 




ATG 


3/25 




TGT 


13/25 




AGA 


2/25 






0/25 


a 




5/25 




CTT 


1/25 




CCT 


3/25 




TTC 


6/25 




TAC 


5/25 




CAC 


1/25 


s 




9/25 




GAA 


3/17 




GCT 


3/17 




ACA 


1/17 




AAA 


1/17 




AAC 


1/17 






8/17 




GGC 


1/23 




TGG 


8/23 




ATC 


0/23 




AGC 


1/23 




GIT. 


4/23 






9/23 



25 



Codons were chosen based on frequent use In the cloned 
subtlllsin gene sequence (Wells et aU, 1983, Id.). 



b Frequency was determined from single track analysis by direct 
plasmid sequencing. 

c Unexpected mutants generally comprised double mutants with 
changes in codons next to 222 or at the points of ligation. 
These were believed to result from impurities in the 
obigonucleotlde pools and/or erroneous repair of the gapped 
ends. 
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Example 17 

Mutant Subtil isin Exhfbftfng Improved Oxidation Stability 

5 Subtil isins having cysteine and alanine substituted at the 222 

position for wild- type methionine (Example 16) were assayed for 
resistance to oxidation by incubating with various concentrations of 
sodium hypochlorite (Clorox Bleach). 

To a total volume of 400 ul of 0.1M, pH 7, NaP0 4 buffer 
containing the indicated bleach concentrations (Fig. 14) sufficient 
enzyme was added to give a final concentration of 0.016 mg/ml of 
en?yme. The solutions were incubated at 25°C for 10 min. and 
assayed for enzyme activity as follows: 120 ul of either ala+222 or 
wild type, or 100 ul of the cys+222 incubation mixture was combined 
with 890 u l 0.1K tris buffer at pH 8.6 and 10 ul of a sAAPFpN 
(Example 18) substrate solution (20 mg/ml in DMS0). The rate of 
Increase in absorbance at 410 nm due to release of p-n1troaniline 
(Del Mar, E.G.. et aU, 1979 "Anal. Biochem." 99, 316-320) was 
monitored. The results are shown in Fig. 14. The alanine 
substitution produced considerably more stable enayme than either 
the wild-type enzyme or a mutant 1n which, a labile cysteine residue 
was substituted for methionine. Surprisingly, the alanine 
substitution did not substantially interfere with enzyme activity 
against the assay substrate, yet conferred relative oxidation 
stability on the enzyme. The serine+222 mutant also exhibited 
improved oxidation stability. 



30 . Example 18 

Mutant Subtil islns Exhibiting Modified Kinetics and Substrate 
Specificity ~™~ 

35 Various mutants for glycine+166 were screened for modified 



10 



15 



20 



25 
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10 



15 



Kcat, Km and Kcat/Km ratios. Kinetic parameters were obtained by 
analysis of the progress curves of the reactions. The rate of 
rection was measured as a function of substrate concentration. Data 
was analyzed by fitting to the Michael Is-Men ton equation using the 
non-linear regression algorithm of Marquardt (Marquardt, D. W. 1963, 
"0. Soc. Ind. Appl. Math." 11, 431-41). All reactions were 
conducted at 25°C in 0.1M tr1s buffer, pH 8.6, containing 
benzoyl-L-Valyl-Glycyl-L-Arg1nyl-p-nitroanil1de {BVGRpN; Vega 
Blochemicals) at Initial concentrations of 0.0025 M to 0.00026 M 
(depending on the value of Km for the enzyme of interest - 
concentrations were adjusted 1n each measurement so as to exceed Km) 
or sucdnyl -L-Al any! -L-Al anyl -L-Prolyl -L-Phenyl al anyl -p-nl tro- 
anllide (sAAPFpN; Vega Blochemicals) at initial concentrations of 
0.0010 M to 0.00028 M (varying as described for BVGRpN). 

The results obtained in these experiments were as follows: 



Table D 



20 Substrate 



Enzyme 



Kcat (s~l) 





SAAPFpN 


gly-l66(w1ld type) 


37 






ala+166 


19 




r 


asp+166 


3 


25 




glu+166 


11 






phe+166 


3 






hys+166 


15 






lys+166 


15 






asn+166 


26 


30 




- arg+166 


19 






val+166 


1 




BVGRpN 


Wild Type 


2 






asp+166 


2 






glu+166 


2 


35 




asn+166 


1 



Km (M) 


Kcat/Km 


1.4xl0~ 4 


3 x 10 5 


2.7X10' 5 


7 x 10 5 


5.8xl0 -4 


5 x 10 3 


3.4xl0~ 4 


3 x 10 4 


1.4xlO" S 


2 x 10 5 


l.lxlO' 4 


1 x 10 5 


3.4xl0 -5 


4 x 10 5 


1.4xl0~ 4 


2 x 10 S 


6.2xl0" 5 


3 x 10 S 


1.4X10" 4 


1 x 10 4 


l.lxlO" 3 


2 x 10 3 


4.1X10' 5 


5 x 10 4 


2.7X10' 5 


7 x 10 4 


1.2X10" 4 


8 x 10 3 
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The Kcat/Km ratio for each of the mutants varied from that of 
the wild-type enzyme. As a measure of catalytic efficiency, these - 
ratios demonstrate that enzymes having much higher activity against 
a given substrate can be readily designed and selected by screening 
5 in accordance with the invention herein. For example, A166 exhibits 
over 2 times the activity of the wild type on sAAPFpK. 

This data also demonstrates changes 1n substrate specificity 
upon mutation of the wild type enzyme. For exanple, the Kcat/Km 
ratio for the D166 and E166 mutants is higher than the wild type 
enzyme with the BVGpN substrate, but qualitatively opposite results 
were obtained upon incubation with sAAPFpM. Accordingly, the D166 
and E166 mutants were relatively more specific for BVGRpM than for 
sAAPFpN. 

Example 19 

Mutant Subtilisin Exhibiting Modified pH-Activity Profile 

The pH profile of the Qys+222 mutant obtained in Example 16 was 
compared to that of the wild type enzyme. 10^1 of 60 mg/ml sAAPFpH 
in DMSO, 10 nl of Cys+222 (0.18 mg/ml) or-jwild type (0.5 mg/ml) and 
980 ul of buffer (for measurements at pH 6.6, 7.0 and 7.6, 0.1M 
NaPO^ buffer; at pH 8.2, 8.6 and 9.2, 0.1M tris buffer; and at pH 
9.6 and 10.0, 0.1M glycine buffer), after which the initial rate of 
change in absorbance at 410 nm per minute was measured at each pH 
and the data plotted in Fig. 15. The Cys+222 mutant exhibits a 
sharper pH optimum than the wild type enzyme. 

* 

30 - Example 20 

Site-Specific Mutagenesis of the Subtilisin Gene at Position 169 

The procedure of Examples 13-14 was followed in substantial 
35 detail, except that the mutagenesis primer differed (the primer 
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shown 1n Fig. 16 was used), the two restriction enzymes were Kpnl 
and EcoRV rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig* 16. 

Bacillus strains excreting mutant subtllisins at position 169 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala and ser for the 
wild-type residue were recovered and assayed for changes in kinetic 
features. The asssy employed SAAPFpN at pH 8.6 1n the same fashion 
as set forth in Example 18. The results were as follows: 

Table E 

Enzyme Kcat (s" 1 ) Km (M) Kcat/Km 

• ala+169 58 7.5 x 10" 5 8 x 10 5 

ser+769 38 8.5 x NT 5 4 x 10 S 



Example 21 

Alterations In Specific Activity on a Protein Substrate 

Position 166 mutants from Examples 15* and" 16 were assayed for 
alteration of specific activity on a naturally occuring protein 
substrate. Because these mutant proteases could display altered 
specificity as well as altered specific activity, the substrate 
should contain sufficient different cleavage sites i.e., acidic, 
basic, neutral, and hydrophobic, so as not to bias the assay toward 
a protease with one lype of specificity. The substrate should also 
contain no derivitized residues that result In the masking of 
certain cleavage sites. The widely used substrates such as 
hemoglobin, azocollogen, azocaseln, dimethyl casein, etc., were 
rejected on this basis. Bovine casein, a and o 2 chains, was 
chosen as a suitable substrate. 
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A 1 percent casein (w/v) solution was prepared in a 100 mM Tr1s 
buffer, pH 8.0, 10 mM EDTA« The assay protocol is as follows: 



This assay mixture was mixed and allowed to incubate at room 
temperature for 20 minutes* The reaction was terminated upon the 

10 addition of 100 ill 100 percent trichloroacetic acid, followed by 
incubation for 15 minutes at room temperature* The precipitated 
protein was pelleted by centrifugation and the optical density of 
the supernatant was determined spectrophotometrically at 280 nra. 
The optical density is a reflection of the amount of unprecipitated, . 

15 Le*, f\ydrolyzed, casein in the reaction mixture. The amount of 
casein hydrolysed by each mutant protease was compared to a series 
of standards containing various amounts of the wild type protease, 
and the activity Is expressed as a percentage of the corresponding 
wild type activity. Enzyme activities were converted to specific 

20 activity hy dividing the casein hydrolysis activity by the 280 nra 
absorbance of the enzyme solution used In the assay. 

All of the mutants which were assayed showed less specific 
activity on casein than the wild type with the exception of Asn+166 
25 which was 26 percent more active on casein than the wild type. The 
mutant showing the least specific activity was 1le+l66 at 0.184 of 
the wild type activity- 



5 



790 ill 50 mM Tris pH 8.2 
100 (il 1 percent casein (Sigma) solution 
10 |il test en2yme {10-200 pg). 



35 



30 
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CLAIMS 

1. A method comprising: 

(a) obtaining a DNA moiety encoding at least a portion 
5 of said precursor protein; 

(b) identifying a region within the moiety; 
<c) substituting nucleotides for those already 

existing within the region in order to create at least one 
restriction enzyme site unique to the moiety, whereby unique 
10 restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

(d) synthesizing a plurality of oligonucleotides, the 
5» and 3' ends of which each contain sequences capable of 

15 annealing to the restriction enzyme sites introduced in step 
(c) and which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 
amino acid in or into said precursor protein; 

(e) digesting the moiety of step (c) with restriction 
20 enzymes capable of cleaving the unique sites; 

(f ) ligating each of the oligonucleotides of step (d) 
into the digested moiety of step (e) whereby a plurality of 
mutant DNA moieties are obtained; 

and optionally the further steps of 
25 <g) expressing each of said moieties as a mutant 

protein in a suitable host; 

(h) recovering the mutant proteins of step (g); and 

(i) screening the step (b) mutant proteins for the 
desirable characteristic. 

30 

2. The method of claim 1 wherein the restriction enzyme 
sites are different. 

3. The method of claim 1 or claim 2 wherein the oligo- 
35 nucleotides are less than about 50 bp. 
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P RBSM'RE^-PRO MAT TERM 



-I.Skb- 



B. 



-1*7 
fNpt 

'ATTC7GDWWTMAAAAA«W«ifl(WATAWRA GTG 



T p T T RBS 

1 jGTCjjCTAAAATATTATC^ 

-inn PRE -on 

Arq Gly lys Lys Vel Trp He 5er Leu Leu Phe Ala Leo Ala Lei» Me Phe Thr Ket Ala Phi 61 v Ser Thr Ser 
99 AM ARC AAA AAA G1A TGG ATC AGT TTG CTG TTT GCT TTA GCG TTA ATC TTT ACG ATG GCG TTC 6RC AGC ACA TCC 

-an -» PRO -6ft 

Ser Ala Gin Ala Ala Gly lvs Ser Asn Glv Gtu Lys Lys Tyr lie Val Glv Phe Lvs Gin Thr Het Ser Thr Met 
174 TCT GCC CAG 6CG GCA GGG AAA TCA AAC GGG GAA AAG AAA TAT An GTC GGG TT? AAA CAG ACA AT6 AGC ACG ATG 



-50 -4n 
Ser Ala Ala Lv* Lys Lys Asp Val lie Ser Gly Lys Gly Gly Lvs Val Gin Lvs Gin Phe LvS Tvr Va1 Aso Ala 
740 ARC GTC fiCT AAG AAG AAA GAT GTC ATT TCT GAA AAA GRC GGG AAA GTG CAA AAG CAA TTC AAA TAT GTA fiAC GCA 

-» -W 
Ala Ser Ala Thr Ceo Asn filu Lys Ala Val Lys Glu Leu Lvs lys Asp Pro Spt Val Ala Tvr Val Glu Glu Asp 
3?4 GCT TCA GCT ACA TTA AAC GAA AAA GtT GTA AAA GAA TTG AAA AAA GAC CCG AGC GTC GCT TAC GTT GAA GAA GAT 

-MAT 



-ill 



2P 3n 



50 











1A 




















Tyr 


GlV 


Val 


Ser 


Gin 


Me 


Lys 


Ala 


Pro 


Ala 


Leu 


Hit 


Ser 


Gin 


TAC 


AGC 


OTA 


TCA 


CAA 


An 


AAA 


GCC 


CCT 


GCT 


CTG 


CAC 


TCT 


CAA 


He 


Asn 


Ser 


Gly 


lie 


Asp 


Ser 


Ser 


HfB 


4H 
Pro 


Asn 


Lew 


lvs 


Val 


ATC 


GAC 


AGC 


GGT 


ATC 


GAT 


TCT 


TCT 


CAT 


CCT 


GAT 


TTA 


AAG 


GTA 


Pro 


Asn 






GO 


Asn 


















Asn 


Pro 


Phe 


Gin 


Asp 


Asn 


Asn 


Ser 


His 


Glv 


Thr 


HtS 


val 


Ala 


AAT 


CCT 


TTC 


CAA 


GAC 


AAC 


AAC 


TCT 


CAC 


GGA 


ACT 


CAC 


GTT 


GCC 



70 pn Ser Ala 

Glv Thr Val Ala Ala Leu Asn Asn Ser lie Gly Val Leu Gly Vil Ala Pro Ser Ala Spt Let! Tyr Ala Val Lvs 
624 GGC ACA GTT 6CG GCT CTT AAT AAC TCA ATC 6GT GTA TTA GGC GTT GCG CCA AGC GCA TCA CTT TAC GCT GTA AAA 

Asn Ala inn U" 
Val Leu Gly Ala Asp Gly Ser Gly Gin Tyr Ser Trp He Tie Asn Glv lie Glu Trp Ala He Ala Asn Asn Met 
GH CTC GGT GCT GAC GGT TCC GGC CAA TAC AGC TGG ATC ATT AAC GGA ATC RAG TGG GCG ATC GCA AAC AAT ATG 

120 1* 14n 

Asp Val He Asn Met Ser Leu Glv Glv Pro Ser Gly Ser Ala Ala Leu lvs Ala Ala Val Asp Lvs A1b Val Ala 
774 GAC GTT ATT AAC ATG AGC CTC GGC GGA CCT TCT GGT TCT GCT GCT TTA AAA GCG GCA GTT GAT AAA GCC GTT GCA 

!5n Ser Thr l«n 

Ser Gly Val Val Val Val Ala Ala Ala Glv Asn Glu Gly Thr Ser G)y Ser Ser Ser Thr Val Glv Tvr Pro Glv 
84* TCC GGC GTC GTA GTC GTT GCG GCA GCC GRT AAC GAA RGC ACT TCC GGC AGC TCA ARC ACA GT6 GGC TAC CCT GGT 

170 inn l«n 

Lvi Tvr Pro Ser Val He Ala Val filv Ala Val Asp Ser Ser Asn Gin Am Ala Ser Php Srr Ser Val Gly Pro 
924 AAA TAC CCT TCT GTC ATT GCA GTA GGC GCT GTT GAC AGC AGC AAC CAA AGA GCA TCT TTC TCA ARC GTA GGA CCT 

Ztm Pin 
Glu Leu Asp Val Met Ala Pro.Gly Val Ser He Gin Ser Tbr Leu Pro Gly Asn Lvs Tvr Gly Ala Tvr Asn Glv 
999 GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA AGC ACG CTT CCT GGA AAC AAA TAC GRG 6CR TAC AAC GGT 

2?n Pan ?4n 

Thr Ser Wet Ala Ser Pro His Val Ala Gly Ala Ala Ala leu lie leu Ser Lvs His Pro Asn Trp Thr Asn Thr 
W74 ACG TCA ATG GCA TCT CCG CAC GTT GCC GGA GCG GCT GCT TTG ATT CTT TCT AAG CAC CCG AAC TRR ACA AAC ACT 

?50 Gin Pfin 
Gin Val Arq Ser Ser Leu Glu Asn Thr Thr Thr Lys Leu Gly Asp Ser Phe Tyr Tyr Glv Lvs Glv Leu He Asn 
114* CAA GTC CGC AGC AGT TTA GAA AAC ACC ACT ACA AAA CTT GGT GAT TCT TTC TAC TAT GGA AAA GGR CTG ATC AAC 

?7n 275 TFRM 

Val Gin Ala Ala Ala Gin «C 1 *> nm 

1?« GTA CAG GCR GCA GCT CAG TAA AAC ATAAAAAACCfiGCC TTGRCCCC ft&RRTTTHTAn A nn 

131B ATAATCGACG&ATGGCTCCCTCTGAAAATTTTAACGAGAAACG^ 

141G CTTCrXGGffTCCGGTCAGCTCAATGr^^ 
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Fig. U. 
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Fig.7. 



101 



185 



GATMACCTAAATAGAfiATAAAATCATCTCAAAAAAATBGGTCTACTRAAATATTATTCCATCTATTACAATAAATTCACA6AATA6TCTTTTAAGTAA6 

-100 

fHet Arg Ser lys Lys Leu Trp lie Ser Leu Leu Phe Ala Leu Thr Leu 
TCTACTCT6AATTTTTTTAAAAGGAGAGGGTAAAGA GTG AGA AGC AAA AAA TTG TG6 ATC AGC TTG TTG TTT GCG TTA ACG TTA 

.on ^80 -70 

lie Phe Thr Net Ala Phe Ser Asn Met Ser Ala Gin Ala A1.a Gly Lys Ser Ser Thr Glu Lys Lys Tyr He Vel 
ATC TTT ACG ATG GCG TTC AGC AAC ATG TCT GC6 CAG GCT 6CC GGA AAA AGC A6T ACA GAA AAG AAA TAC ATT GTC 

-60 -50 
Gly Phe Ly$ Gin Thr Het Ser Ala Met Ser Ser Ala Lys Lys Lys Asp Val lie Ser Glu Lys Gly Gly Lys Val 
260 GGA TTT AAA CAG ACA ATG AGT GCC ATG AGT TCC GCC AAG AAA AAG GAT 8TT ATT TCT GAA AAA GGC GGA AAG GTT 

-40 -30 -20 

Bin Lvs Gin Phe Lys Tyr Val Asn Ala Ala Ala Ala Thr Leu Asp Glu Lys Ala Val Lys Glu Leu Lys Lys Asp 
335 CAA AAG CAA TTT AAG TAT GTT AAC GCG GCC GCA GCA ACA TTG GAT GAA aSa GCT GTA AAA GAA TTG AAA AAA GAT 

.10 -11 10 ' 

Pro Ser Val Ala Tyr Val Glu Glu Asp His He Ala His Glu Tyr Al* Gin Ser Val Pro Tyr Gly lie Ser Gin 
id AGC GTT GCA TAT GTG GAA GAA GAT CAT ATT GCA CAT GAA T*T GCG CAA TCT GTT CCT TAT 66* ATT TCT CAA 

20 30 32 

lie Lys Ale Pro Ala Leo H1* Ser Gin Gly Tyr Thr Gly Ser Asn Val lys Val Ala Val He Asp Ser Gly lie 
ATT AAA GCG CCG GCT CTT CAC TCT CAA GGC TAC ACA GGC TCT AAC GTA AAA GTA GCT GTT ATC GAC AGC GGA ATT 

40 SO - 60 

A» Ser Ser His Pro Asp Leu Asn Val Arg Gly 61y Ala Ser Phe Val Pro Ser Glu Thr Asn Pro Tyr Gin Asp 
GAC TCT TCT CAT CCT GAC TTA AAC GTC AG* GGC GGA GCA AGC TTC GTA CCT TCT GAA ACA AAC CCA TAC CAG GAC 

64 70 60 

Glv Ser Ser His Gly Thr His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu Gly Val Ser 
6GC AGT TCT CAC GGT ACG CAT GTA GCC GGT ACG ATT GCC GCT CTT AAT AAC TCA ATC GGT GTT CTG GGC GTT AGC 

90 100 HO 

Pro Ser Ala Ser Leu Tyr Ala Val Lys Val Leu Asp Ser Thr Gly Ser Gly Gin Tyr Ser Trp ITe He Asn 61y 
CCA AGC GCA TCA TTA Ut GCA GTA AAA'GTG CTT GAT TCA ACA GGA AGC GGC CAA TAT A$C TGfe ATT ATT AAC GGC 

IZO 130 
lie Glu Tm Ala lie Ser Asn Asn Het Asp Val lie Asn Het Ser Leu Gly Gly Pro Thr Gly Ser Thr Ala Leu 
785 ATT GAG T6& GC? ATT ill A« AAT ATG GAT GTT ATC AAC ATG AGC CTT GGC GGA CCT ACT GGT TCT ACA GCG CTG 

140 150 160 

L*s Thr Val Val Asp Lys Ala Val Ser Ser Gly He Val Val Ala Ala Ala Ala Gly Asn Glu Gly Ser Ser Gly 
860 AAA ACA GTC GTT GAC AAA GCC GTT TCC AGC GGT ATC GTC GTT GCT GCC GCA GCC GGA AAC GAA GGT TCA TCC GGA 

170 180 
Ser Thr Ser Thr Val Gly Tyr Pro Ala Lys Tyr Pro Ser Thr lie Ala Val Gly Ala Val Asn Ser Ser Asn GTn 
935 AGC ACA AGC ACA G?C GGC TAC CCT GCA AAA TAT CCT TCT ACT ATT GCA GTA GGT GCG GTA AAC AGC AGC AAC CAA 

loo 200 210 

Aro Ala Ser Phe Ser Ser Ala Gly Ser Glu Leu Asp Val Het Ala Pro Gly Val Ser He Gin Ser Thr Leu Pro 
SgS GCT TCA TTC TCC AGC GCA GGT TCT GAG CTT GAT GTG ATG GCT CCT GGC GTG TCC ATC CAA AGC ACA CTT CCT 

* 220 221 230 
1v Glv Thr Tyr Gly Ala Tyr Asn Gly Thr Ser Het Ala Thr Pro His Val Ala 61y Ala Ala Ala Leu lie Leu 
GA GGC ACT TAC GG? GCT TAT AAC GGA ACG TCC ATG GCG ACT CCT CAC GTT QCC GGA GCA GCA GCG TTA ATT CTT 

240 250 260 

Ser Lvs His Pro Thr Trp Thr Asn Ala Gin Val Arg Asp Arg Leu Glu Ser Thr Ala Thr Tyr Leu Glv Asn Ser 
1160 TCT AAG CAC CCS ACT TGG ACA AAC GCG CAA GTC CGT GAT CGf TTA GAA AGC ACT GCA ACA TAT CTT GG* AAC TCT 

270 

php Tvr Tvr Glv Lvs 61y Leu lie Asn Val Gin Ala Ala Ala Gin OC 
1235 TTC Ac TAT GGA AAA GG? TtS ATC AAC GTA CAA GCA GCT GCA CAA TAA TAGTAAAAAGAAGCAGGTTCCTCCATACCTGCTTC 

1318 TTTTTATTTGTCAGCATCCTGATGTTCCGGCGCATTCTCTTCTTTCTCCGCATGTT6AATCCGTTCCATGATCGACGGATGGCTGCCTCTGAAAATCTTC 

1418 ACAAGCACC6GAGGATCAACCT6CTCA6CCCCGTCACGGCCAAATCCTGAAACGTTTTAACACTG6CTTCTCTGTTCTCT6TC 
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